Book description
This book is a step-by-step guide for learning how to use Spark for different types of big-data analytics projects, including batch, interactive, graph, and stream data analysis as well as machine learning. It covers Spark core and its add-on libraries, including Spark SQL, Spark Streaming, GraphX, MLlib, and Spark ML.
Big Data Analytics with Spark shows you how to use Spark and leverage its easy-to-use features to increase your productivity. You learn to perform fast data analysis using its in-memory caching and advanced execution engine, employ in-memory computing capabilities for building high-performance machine learning and low-latency interactive analytics applications, and much more. Moreover, the book shows you how to use Spark as a single integrated platform for a variety of data processing tasks, including ETL pipelines, BI, live data stream processing, graph analytics, and machine learning.
The book also includes a chapter on Scala, the hottest functional programming language, and the language that underlies Spark. You’ll learn the basics of functional programming in Scala, so that you can write Spark applications in it.
What's more, Big Data Analytics with Spark provides an introduction to other big data technologies that are commonly used along with Spark, such as HDFS, Avro, Parquet, Kafka, Cassandra, HBase, Mesos, and so on. It also provides an introduction to machine learning and graph concepts. So the book is self-sufficient; all the technologies that you need to know to use Spark are covered. The only thing that you are expected to have is some programming knowledge in any language.
Table of contents
- Cover
- Title
- Copyright
- Dedication
- Contents at a Glance
- Contents
- About the Author
- About the Technical Reviewers
- Acknowledgments
- Introduction
- Chapter 1 : Big Data Technology Landscape
- Chapter 2 : Programming in Scala
- Chapter 3 : Spark Core
- Chapter 4 : Interactive Data Analysis with Spark Shell
- Chapter 5 : Writing a Spark Application
- Chapter 6 : Spark Streaming
- Chapter 7 : Spark SQL
- Chapter 8 : Machine Learning with Spark
- Chapter 9 : Graph Processing with Spark
- Chapter 10 : Cluster Managers
- Chapter 11 : Monitoring
- Bibliography
- Index
Product information
- Title: Big Data Analytics with Spark: A Practitioner’s Guide to Using Spark for Large-Scale Data Processing, Machine Learning, and Graph Analytics, and High-Velocity Data Stream Processing
- Author(s):
- Release date: January 2016
- Publisher(s): Apress
- ISBN: 9781484209646
You might also like
book
Scala Programming for Big Data Analytics : Get Started With Big Data Analytics Using Apache Spark
Gain the key language concepts and programming techniques of Scala in the context of big data …
book
Data Analytics with Spark Using Python, First edition
Spark for Data Professionals introduces and solidifies the concepts behind Spark 2.x, teaching working developers, architects, …
book
Scala and Spark for Big Data Analytics
Harness the power of Scala to program Spark and analyze tonnes of data in the blink …
book
Practical Big Data Analytics
Get command of your organizational Big Data using the power of data science and analytics About …