20. Optimizing Spark Applications

This chapter covers the following:

Image Understanding the Spark execution model

Image Shuffle operations and how to minimize them

Image Selecting appropriate operators

Image Partitioning and parallelism

Image Understanding Spark’s query optimizer ...

Get Expert Hadoop® Administration now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.