March 2019
Beginner to intermediate
182 pages
4h 6m
English
In this chapter, we will look at the immutable design of Apache Spark. We will delve into the Spark RDD's parent/child chain and use RDD in an immutable way. We will then use DataFrame operations for transformations to discuss immutability in a highly concurrent environment. By the end of this chapter, we will use the Dataset API in an immutable way.
In this chapter, we will cover the following topics:
Read now
Unlock full access