Advanced Programming Using the Spark Core API

Technology feeds on itself. Technology makes more technology possible.

Alvin Toffler, American writer and futurist

In This Chapter:

Introduction to shared variables (broadcast variables and accumulators) in Spark

Partitioning and repartitioning of Spark RDDs

Storage options for RDDs

Caching, distributed ...

Get Data Analytics with Spark Using Python, First edition now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.