March 2022
Beginner to intermediate
456 pages
13h
English
1.1 What is PySpark?
Taking it from the start: What is Spark?
1.2 Your very own factory: How PySpark works
Some physical planning with the cluster manager
A factory made efficient through a lazy leader
1.3 What will you learn in this book?
1.4 What do I need to get started?
Part 1. Get acquainted: First steps in PySpark
2 Your first data program in PySpark
2.1 Setting up the PySpark shell
Configuring how chatty spark is: The log level
2.3 Ingest and explore: Setting the stage for data transformation