Skip to Content
High Performance Spark, 2nd Edition
book

High Performance Spark, 2nd Edition

by Holden Karau, Adi Polak, Rachel Warren
May 2026
Intermediate to advanced
350 pages
2h 50m
English
O'Reilly Media, Inc.
Content preview from High Performance Spark, 2nd Edition

Brief Table of Contents (Not Yet Final)

Ch 1: Intro to High Performance Spark (available)

Ch 2: How Spark Works (not available)

Ch 3: Upgrading Spark (available)

Ch 4: What’s new in Apache Spark 3.3 (not available)

Ch 5: Dataframes, Datasets, & SparkSQL (not available)

Ch 6: Joins (SQL and core) (available)

Ch 7: Effective transformations (not available)

Ch 8: Working with Key/Value data (not available)

Ch 9: Going Beyond Scala (available)

Ch 10: Generative AI / DL & Multi-Tool Pipelines (not available)

Ch 11: Testing and validation (available)

Ch 12: Spark components and packages (available)

Ch 13: Spark Streaming (not available)

Ch 14: Debugging, tuning and other things developers like to pretend don’t exist (not available)

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Learning Spark, 2nd Edition

Learning Spark, 2nd Edition

Jules S. Damji, Brooke Wenig, Tathagata Das, Denny Lee

Publisher Resources

ISBN: 9781098145842Errata Page