4

Learning Spark Programming Basics

Talk is cheap. Show me the code.

Linus Torvalds, Finnish-American creator of Linux

In This Chapter:

Resilient Distributed Datasets (RDDs)

How to load data into Spark RDDs

Transformation and actions on RDDs

How to perform operations on multiple RDDs

Now that we’ve covered Spark’s runtime architecture and how ...

Get Data Analytics with Spark Using Python, First edition now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.