March 2019
Beginner to intermediate
182 pages
4h 6m
English
In this chapter, we'll be working with the Spark key/value API. We will start by looking at the available transformations on key/value pairs. We will then learn how to use the aggregateByKey method instead of the groupBy() method. Later, we'll be looking at actions on key/value pairs and looking at the available partitioners on key/value data. At the end of this chapter, we'll be implementing an advanced partitioner that will be able to partition our data by range.
In this chapter, we will be covering the following topics:
Read now
Unlock full access