4

Learning Spark Programming Basics

Talk is cheap. Show me the code.

Linus Torvalds, Finnish-American creator of Linux

In This Chapter:

Resilient Distributed Datasets (RDDs)

How to load data into Spark RDDs

Transformation and actions on RDDs

How to perform operations on multiple RDDs

Now that we’ve covered Spark’s runtime architecture and how ...

Get Data Analytics with Spark Using Python, First edition now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.