References
The references are listed here:
- http://spark-project.org/docs/latest/scala-programming-guide.html#hadoop-datasets
- http://opencsv.sourceforge.net/
- http://commons.apache.org/proper/commons-csv/
- http://hadoop.apache.org/docs/current/api/org/apache/hadoop/mapred/SequenceFileInputFormat.html
- http://hadoop.apache.org/docs/current/api/org/apache/hadoop/mapred/InputFormat.html
- http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/
- http://spark.apache.org/docs/latest/api/python/
- http://wiki.apache.org/hadoop/SequenceFile
- http://hbase.apache.org/book/quickstart.html
- http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/mapreduce/TableInputFormat.html
- https://spark.apache.org/docs/latest/api/java/org/apache/spark/api/java/JavaPairRDD.html ...
Get Fast Data Processing with Spark 2 - Third Edition now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.