July 2017
Intermediate to advanced
796 pages
18h 55m
English
Most aggregations can be done using functions that can be found in the org.apache.spark.sql.functions package. In addition, custom aggregation functions can also be created, also known as User Defined Aggregation Functions (UDAF).
We will load the sample data to illustrate all the different types of aggregate functions in this section:
val statesPopulationDF = spark.read.option("header", "true").option("inferschema", "true").option("sep", ",").csv("statesPopulation.csv")
Read now
Unlock full access