Overview
In this 6-hour course, you will gain a foundational understanding of Spark programming in Scala while exploring Apache Spark 3's architecture. You will learn practical data engineering skills through examples, focusing on core Spark features like Structured API, DataFrames, and Datasets.
What I will be able to do after this course
- Understand Apache Spark's architecture and its programming model.
- Learn to process data using Spark's Structured API, transformations, and aggregations.
- Work with data sources, sinks, and perform advanced joins using DataFrames.
- Master using IntelliJ IDEA for Spark development, debugging, and deploying applications.
- Develop practical skills in unit testing and managing logs for Spark applications.
Course Instructor(s)
ScholarNest is a proficient instructor with a strong background in data engineering and extensive teaching experience in technical topics. Their approach involves combining theory with practical coding sessions, making sure concepts are effectively understood. They aim to equip you with the skills needed for real-world Spark projects.
Who is it for?
This course is ideal for software engineers eager to build data engineering pipelines with Spark, data architects and engineers managing data infrastructure, and managers seeking an understanding of Spark's potential. Prior experience with Scala programming is recommended for learners.
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Watch now
Unlock full access