Skip to Content
Apache Spark Quick Start Guide
book

Apache Spark Quick Start Guide

by Shrey Mehrotra, Akash Grade
January 2019
Beginner to intermediate
154 pages
4h 31m
English
Packt Publishing

Overview

Dive into the world of scalable data processing with the "Apache Spark Quick Start Guide." This book offers a foundational introduction to Spark, empowering readers to harness its capabilities for big data processing. With clear explanations and hands-on examples, you'll learn to implement Spark applications that handle complex data tasks efficiently.

What this Book will help me do

  • Understand and implement Spark's RDDs and DataFrame APIs to process large datasets effectively.
  • Set up a local development environment for Spark-based projects.
  • Develop skills to debug and optimize slow-performing Spark applications.
  • Harness built-in modules of Spark for SQL, streaming, and machine learning applications.
  • Adopt best practices and optimization techniques for high-performance Spark applications.

Author(s)

Shrey Mehrotra is a seasoned software developer with expertise in big data technologies, particularly Apache Spark. With years of hands-on industry experience, Shrey focuses on making complex technical concepts accessible to all. Through his writing, he aims to share clear, practical guidance for developers of all levels.

Who is it for?

This guide is perfect for big data enthusiasts and professionals looking to learn Apache Spark's capabilities from scratch. It's aimed at data engineers interested in optimizing application performance and data scientists wanting to integrate machine learning with Spark. A basic familiarity with either Scala, Python, or Java is recommended.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Debugging Apache Spark

Debugging Apache Spark

Holden Karau
Apache Spark 2.x for Java Developers

Apache Spark 2.x for Java Developers

Sumit Kumar, Sourav Gulati

Publisher Resources

ISBN: 9781789349108Supplemental Content