Skip to Content
Apache Spark for Data Science Cookbook
book

Apache Spark for Data Science Cookbook

by Padma Priya Chitturi
December 2016
Beginner to intermediate
392 pages
8h 13m
English
Packt Publishing

Overview

In "Apache Spark for Data Science Cookbook," you'll delve into solving real-world analytical challenges using the robust Apache Spark framework. This book features hands-on recipes that cover data analysis, distributed machine learning, and real-time data processing. You'll gain practical skills to process, visualize, and extract insights from large datasets efficiently.

What this Book will help me do

  • Master using Apache Spark for processing and analyzing large-scale datasets effectively.
  • Harness Spark's MLLib for implementing machine learning algorithms like classification and clustering.
  • Utilize libraries such as NumPy, SciPy, and Pandas in conjunction with Spark for numerical computations.
  • Apply techniques like Natural Language Processing and text mining using Spark-integrated tools.
  • Perform end-to-end data science workflows, including data exploration, modeling, and visualization.

Author(s)

Nagamallikarjuna Inelu and None Chitturi bring their extensive experience working with data science and distributed computing frameworks like Apache Spark. Nagamallikarjuna specializes in applying machine learning algorithms to big data problems, while None has contributed to various big data system implementations. Together, they focus on providing practitioners with practical and efficient solutions.

Who is it for?

This book is primarily intended for novice and intermediate data scientists and analysts who are curious about using Apache Spark to tackle data science problems. Readers are expected to have some familiarity with basic data science tasks. If you want to learn practical applications of Spark in data analysis and enhance your big data analytics skills, this resource is for you.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Apache Spark Deep Learning Cookbook

Apache Spark Deep Learning Cookbook

Ahmed Sherif, Amrith Ravindra, Michal Malohlava, Adnan Masood
Spark Cookbook

Spark Cookbook

Rishi Yadav

Publisher Resources

ISBN: 9781785880100Supplemental Content