Why data analytics on big data?Big data for analyticsBig data – a bigger pay package for Java developersBasics of Hadoop – a Java sub-projectDistributed computing on HadoopHDFS conceptsDesign and architecture of HDFSMain components of HDFSHDFS simple commandsApache SparkConceptsTransformationsActionsSpark Java APISpark samples using Java 8Loading dataData operations – cleansing and mungingAnalyzing data – count, projection, grouping, aggregation, and max/minActions on RDDsPaired RDDsTransformations on paired RDDsSaving dataCollecting and printing resultsExecuting Spark programs on HadoopApache Spark sub-projectsSpark machine learning modulesMLlib Java APIOther machine learning librariesMahout – a popular Java ML libraryDeeplearning4j – a deep learning libraryCompressing dataAvro and Parquet