Skip to Content
PySpark and AWS: Master Big Data with PySpark and AWS
on-demand course

PySpark and AWS: Master Big Data with PySpark and AWS

with AI Sciences
September 2021
Beginner to intermediate
16h 11m
English
Packt Publishing
Closed Captioning available in English

Overview

In this 16-hour course, learn how to harness the power of PySpark and AWS for big data projects. You'll cover everything from foundational concepts to advanced techniques, explore Spark's architecture, and implement data processing and machine learning workflows, all while utilizing the AWS cloud.

What I will be able to do after this course

  • Understand the Spark and Hadoop architectures and ecosystems.
  • Work with Spark RDDs and Dataframes for effective data processing.
  • Integrate PySpark workflows into AWS services for cloud-based solutions.
  • Build machine learning pipelines using PySpark for real-world applications.
  • Develop ETL and data engineering solutions using PySpark on big data.

Course Instructor(s)

Led by the experienced instructors at AI Sciences, this course aims to provide practical, hands-on programming expertise. The instructors combine academic knowledge with industry experience in data engineering to deliver a focused, project-based learning experience in big data technologies.

Who is it for?

This course is ideal for software engineers, data scientists, and aspiring big data engineers with prior experience in Python programming. Learners interested in gaining cloud computing expertise and understanding real-world applications of PySpark would greatly benefit from this course.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Amazon Web Services AWS LiveLessons 2nd Edition

Amazon Web Services AWS LiveLessons 2nd Edition

Richard Jones
Serverless ETL and Analytics with AWS Glue

Serverless ETL and Analytics with AWS Glue

Vishal Pathak, Subramanya Vajiraya, Noritaka Sekiyama, Tomohiro Tanaka, Albert Quiroga, Ishan Gaur

Publisher Resources

ISBN: 9781803236698Supplemental Content