Cluster mode
Cluster mode is similar to client mode, except that the driver process runs on one of the cluster worker machines, and the cluster manager is responsible for both driver and executor processes. This gives an advantage of running multiple applications at the same time because cluster manager will distribute the driver load across the cluster. This mode is the most common and recommended mode for running the Spark applications. In this mode, the logs can be collected from the cluster manager or you can implement a central logging solution to gather the application logs.
The following diagram shows our sale application running in cluster mode. Driver process is running on the first worker node and all executors are running ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access