Let's have a look at Apache Spark architecture, including a high level overview and a brief description of some of the key software components.
High level overview
At the high level, Apache Spark application architecture consists of the following key software components and it is important to understand each one of them to get to grips with the intricacies of the framework:
- Driver program
- Master node
- Worker node
- SQL context
- Spark session
Here's an overview of how some of these software components fit together within the overall architecture:
Figure 1.15: Apache Spark application architecture - Standalone mode ...