Video description
Hadoop and Spark are the stars of the Big Data world. This course covers the basics of Spark and how to use Spark and Hadoop together for big data analytics. Designed for developers, architects, and data analysts with a fundamental understanding of Hadoop, it begins with an overview of how Hadoop and Spark are used in today's big data ecosystem before moving into hands-on labs that demonstrate Spark and Spark-Hadoop integration.
You'll learn about the Spark shell, RDDs, and DataFrames; how to query data in Hadoop Hive Tables from Spark; and how to develop Spark applications and run them on YARN.
- Discover how to integrate the Hadoop and Spark big data analytics platforms
- Get access to 11 hands-on labs demonstrating the core aspects of Hadoop-Spark integration
- Learn the basics of the Spark framework: Spark shell, RDDs and DataFrames
- Explore methods for analyzing data in Hadoop HDFS and Hive using Spark
- Gain an understanding on how to write Spark applications and run them on YARN
Publisher resources
Table of contents
-
Introduction
- Course Intro And What To Expect 00:01:29
- About The Author 00:00:39
-
Getting Started
- Big Data Eco System Overview 00:05:09
- What Is Spark 00:04:49
- Spark Vs. Hadoop 00:06:41
- Setting Up The Environment 00:06:10
- Setting Up Data In Hadoop Exercise Lab 00:11:40
-
Spark
- Spark And Spark Shell Overview 00:03:32
- Spark Shell Labs 00:07:10
- RDD Overview 00:08:47
- RDD Labs 00:06:00
- DataFrames 00:06:25
- DataFrames Lab Part 1 00:08:03
- DataFrames Lab Part 2 00:03:47
-
Spark And Hive
- Hive Lab Part 1 00:03:09
- Hive Lab Part 2 00:03:44
-
Spark YARN
- Spark And YARN Lab 00:05:15
- Spark Applications 00:03:53
- Spark Applications Lab 1 Part 1 00:05:39
- Spark Applications Lab 1 Part 2 00:02:41
- Spark Applications Lab 2 00:05:44
-
Conclusion
- Wrap Up And Thank You 00:02:07
Product information
- Title: Data Analytics Using Spark and Hadoop
- Author(s):
- Release date: October 2016
- Publisher(s): Infinite Skills
- ISBN: 9781491963159
You might also like
video
Introduction to Big Data
In this Introduction to Big Data training course, expert author Vladimir Bacvanski teaches you about Big …
video
Introduction to PySpark
In this Introduction to PySpark training course, expert author Alex Robbins will teach you everything you …
video
Using Spark in the Hadoop Ecosystem
You're new to Big Data, you've heard about Apache Spark and Apache Hadoop and you want …
video
Mastering Apache Sqoop
In this Mastering Apache Sqoop training course, expert author David Yahalom teaches you everything you need …