Index
A
Access control
cluster/pool/jobs
personal
table
workspace
Access Control Lists (ACLs)
agg function
ALTER command
Amazon Web Service (AWS)
Apache Spark
Apache Software Foundation
architecture
SeeSpark architecture
cluster/parallel processing solutions
components
Databricks moon
data challenges
definition
large-scale analytics
scalability
SQL and DataFrame
Apache Spark DataFrame
Apache Spark/traditional software
approxQuantile
Apress
Authentication Mechanism
B
Bits and pieces
creating data
frequent pattern growth
MLlib
parsing results
preparing data
running algorithm
C
Cleaning/transforming data
caching data
columns
createDataFrame procedure
data compression
explode command
extreme values
fillna command
isNull
isNull, dropna
lambda functions
lazy Pyspark
pivoting
removing duplicates
Cloud ...

Get Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.