Overview
In this 3 hr course, you'll master the fundamentals of Apache Spark and Scala through hands-on big data examples. Gain practical skills in big data processing by learning to build and optimize Spark applications using Scala.
What I will be able to do after this course
- Understand Apache Spark architecture and its processing model.
- Learn to develop applications using Spark RDDs and Spark SQL.
- Master techniques to optimize Spark jobs through caching and partitioning.
- Develop and scale Apache Spark applications on a Hadoop Yarn cluster.
- Analyze datasets using Spark SQL, DataFrames, and Datasets efficiently.
Course Instructor(s)
James Lee is an experienced big data engineer and educator with expertise in Apache Spark and Scala. With years of professional experience and a knack for clear and concise teaching, he breaks down complex topics into manageable lessons.
Who is it for?
This course is designed for software developers and data scientists who have basic programming experience and wish to deepen their understanding of big data processing with Apache Spark. Ideal for career growth in big data development and engineering.
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Watch now
Unlock full access