Video description
This video series highlights what's new in Apache 2.0 and reviews its core concepts. The course starts with a high-level overview of Spark's components and then dives into Spark 2.0's three main themes: simplicity, speed, and intelligence.
The simplicity section describes how Spark 2.0 unifies the Spark APIs and Spark session, and how Spark 2.0 simplifies machine learning via ML pipelines. The speed section illustrates how Spark 2.0 improves Spark performance with the push toward whole-stage code generation. And the intelligence section provides a quick primer on Spark Streaming and an introduction to the concepts of Structured Streaming. The course is designed for data scientists and data engineers with some basic experience using machine learning tools such as Python scikit-learn.
- Understand the key features of Spark 2.0 that make building production pipelines easier than ever
- Learn to solve data analytics problems by performing ad hoc analysis using Spark SQL
- Gain experience in building machine learning solutions using Spark ML pipelines
- Start prototyping with Structured Streaming to build continuous applications
- Understand the paradigm shift and benefits of Datasets and DataFrames
- Learn how Datasets and DataFrames are used for Spark SQL, machine learning, and streaming
Publisher resources
Table of contents
-
Introduction
- Welcome To The Course 00:01:32
- About The Author 00:01:32
-
Introducing Apache Spark 2.0
- What Is Apache Spark 00:07:40
- Getting Started With Apache Spark 00:03:03
- Spark Jobs And APIs 00:06:23
-
Spark 2.0 Simplicity: Unifying Datasets And Dataframes
- Unified API And Spark Session 00:06:38
- Spark MLlib - A Primer On ML Pipelines 00:07:58
- Spark 2.0 Speed: Tungsten Phase 2
-
Spark 2.0 Intelligence: Structured Streaming
- Quick Refresh Of Spark Streaming 00:07:08
- Introducing Structured Streaming 00:04:40
-
Conclusion
- Wrap Up And Thank You 00:01:02
Product information
- Title: Introduction to Apache Spark 2.0
- Author(s):
- Release date: July 2017
- Publisher(s): Infinite Skills
- ISBN: 9781491991220
You might also like
video
Python Fundamentals
51+ hours of video instruction. Overview The professional programmer’s Deitel® video guide to Python development with …
video
Data Engineering with Python and AWS Lambda LiveLessons
7 Hours of Video Instruction Data Engineering with Python and AWS Lambda LiveLessons shows users how …
video
Amazon Web Services AWS LiveLessons 2nd Edition
More Than 17 Hours of Video Instruction More than 17 hours of video instruction on Amazon …
video
Microsoft AZ-900 Certification Course: Azure Fundamentals
Not sure where to start with the Microsoft Azure platform? Whether an IT pro or new …