Scala for Data ScientistsThe Spark Programming ModelRecord LinkageGetting Started: The Spark Shell and SparkContextBringing Data from the Cluster to the ClientShipping Code from the Client to the ClusterFrom RDDs to Data FramesAnalyzing Data with the DataFrame APIFast Summary Statistics for DataFramesPivoting and Reshaping DataFramesJoining DataFrames and Selecting FeaturesPreparing Models for Production EnvironmentsModel EvaluationWhere to Go from Here