Index
A
Apache Spark
installation
master UI
persisting RDD
prerequisites
scala code
storage levels
Apache Zookeeper
Application programming interfaces (APIs)
B
Batch processing
C
Currying function
D
dapplyCollect function
dapply function
Data analytics project architecture
components
data ingestion
processing data
stages
storage
visualization
DataFrames
creation
JSON content
show() method
operations
filter() transformation
groupBy() transformation
select() transformation
view, creation
Data ingestion
Data processing
Datasets
BookDetails.json
operations
reflection-based approach
class attributes
DataFrame, creation
RDD, creation
schema creation
Data storage
Data streaming
Decision tree regression model
creation
predict method
SparkDataFrame
spark.decisionTree {SparkR}
Direct Acylic Graph (DAG) ...
Get Practical Apache Spark: Using the Scala API now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.