Skip to Content
Data Analytics with Spark Using Python, First edition
book

Data Analytics with Spark Using Python, First edition

by Jeffrey Aven
June 2018
Beginner to intermediate content levelBeginner to intermediate
320 pages
10h 1m
English
Addison-Wesley Professional
Content preview from Data Analytics with Spark Using Python, First edition

Introduction

Spark is a first-class data processing platform and programming interface for Big Data which is inexorably linked to the Big Data technology wave. At the time of this writing, Spark is one of the most active open source projects under the Apache Software Foundation (ASF) framework, and it’s one of the most active open source Big Data projects ever.

With so much interest in Spark from the analytics, data processing, and data science communities, it’s important to understand what Spark is, what purpose it serves, what advantages it provides, and how to leverage Spark for Big Data analytics. This book covers all that.

Unlike many other publications dedicated to Spark, which almost exclusively use the Scala API, this book focuses on ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Foundational Python for Data Science

Foundational Python for Data Science

Kennedy Behrman
Scala and Spark for Big Data Analytics

Scala and Spark for Big Data Analytics

Sridhar Alla, Md. Rezaul Karim

Publisher Resources

ISBN: 9780134844855