July 2017
Intermediate to advanced
796 pages
18h 55m
English
RDDs are immutable and every operation creates a new RDD. Now, the two main operations that you can perform on an RDD are Transformations and Actions.
Transformations change the elements in the RDD such as splitting the input element, filtering out elements, and performing calculations of some sort. Several transformations can be performed in a sequence; however no execution takes place during the planning.
The reasoning behind the lazy evaluation is that Spark can look at all the transformations and plan the execution, making use of the understanding ...
Read now
Unlock full access