March 2019
Beginner to intermediate
182 pages
4h 6m
English
Transformations and actions are the main building blocks of an Apache Spark program. In this chapter, we will look at Spark transformations to defer computations and then look at which transformations should be avoided. We will then use the reduce and reduceByKey methods to carry out calculations from a dataset. We will then perform actions that trigger actual computations on graphs. By the end of this chapter, we will also have learned how to reuse the same rdd for different actions.
In this chapter, we will cover the following topics:
Read now
Unlock full access