3

Spark Architecture and Transformations

Spark approaches data processing differently than traditional tools and technologies. To understand Spark’s unique approach, we will have to understand its basic architecture. A deep dive into Spark’s architecture and its components will give you an idea of how Spark achieves its ground-breaking processing speeds for big data analytics.

In this chapter, you will learn about the following broader topics:

  • Spark architecture and execution hierarchy
  • Different Spark components
  • The roles of the Spark driver and Spark executor
  • Different deployment modes in Spark
  • Transformations and actions as Spark operations

By the end of this chapter, you will have valuable insights into Spark’s inner workings and know how ...

Get Databricks Certified Associate Developer for Apache Spark Using Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.