Aggregations
cubes
flight summary dataset
functions
approx_count_distinct (col)
avg(col)
count(col)
countDistinct(col)
description
min(col), max(col)
Scala language
skewness(col), kurtosis(col)
sum(col)
sumDistinct(col)
variance(col), stddev(col)
grouping
categorical values
collection group values
multiple aggregations
origin_airport and Count Aggregation
origin_state and origin_city, Count Aggregation
RelationalGroupedDataset
levels
operations
pivoting
rollups
state
time windows
Alternate-least-square (ALS) algorithm
Arbitrary stateful processing
action
flatMapGroupsWithState
handling state timeouts
mapGroupsWithState
structured streaming
Artificial intelligence (AI)