June 2018
Beginner to intermediate
320 pages
10h 1m
English
Spark is a first-class data processing platform and programming interface for Big Data which is inexorably linked to the Big Data technology wave. At the time of this writing, Spark is one of the most active open source projects under the Apache Software Foundation (ASF) framework, and it’s one of the most active open source Big Data projects ever.
With so much interest in Spark from the analytics, data processing, and data science communities, it’s important to understand what Spark is, what purpose it serves, what advantages it provides, and how to leverage Spark for Big Data analytics. This book covers all that.
Unlike many other publications dedicated to Spark, which almost exclusively use the Scala API, this book focuses on ...