Skip to Content
Data Analytics with Spark Using Python, First edition
book

Data Analytics with Spark Using Python, First edition

by Jeffrey Aven
June 2018
Beginner to intermediate content levelBeginner to intermediate
320 pages
10h 1m
English
Addison-Wesley Professional
Content preview from Data Analytics with Spark Using Python, First edition

4

Learning Spark Programming Basics

Talk is cheap. Show me the code.

Linus Torvalds, Finnish-American creator of Linux

In This Chapter:

Resilient Distributed Datasets (RDDs)

How to load data into Spark RDDs

Transformation and actions on RDDs

How to perform operations on multiple RDDs

Now that we’ve covered Spark’s runtime architecture and how ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Foundational Python for Data Science

Foundational Python for Data Science

Kennedy Behrman
Scala and Spark for Big Data Analytics

Scala and Spark for Big Data Analytics

Sridhar Alla, Md. Rezaul Karim

Publisher Resources

ISBN: 9780134844855