Book description
- Understand Spark unified data processing platform
- Howto run Spark in Spark Shell or Databricks
- Use and manipulate RDDs
- Deal with structured data using Spark SQL through its operations and advanced functions
- Build real-time applications using Spark Structured Streaming
- Develop intelligent applications with the Spark Machine Learning library
Product information
- Title: Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library
- Author(s):
- Release date: August 2018
- Publisher(s): Apress
- ISBN: 9781484235799
You might also like
video
Building an End-to-End Batch Data Pipeline with Apache Spark
Explore Big Data architectures and the tools you can leverage to build an end-to-end data platform. …
video
Apache Spark with Java - Learn Spark from a Big Data Guru
This course covers all the fundamentals of Apache Spark with Java and teaches you everything you …
video
Apache Spark with Scala - Learn Spark from a Big Data Guru
This course covers all the fundamentals of Apache Spark with Scala and teaches you everything you …
video
Apache Spark with Python - Big Data with PySpark and Spark
This course covers all the fundamentals of Apache Spark with Python and teaches you everything you …