Client mode
In the client mode, the driver process runs on the client node (that is, the edge or gateway node) on which the job was submitted. The client node provides resources, such as memory, CPU, and disk space to the driver program, but the executors run on the cluster nodes and they are maintained by the cluster manager, such as YARN. Earlier, we saw how we used client mode to submit our sales application in the previous section. One of the advantages of running your job in client mode is that you can easily access your logs on the same machine. But when your number of Spark applications increase in production, you should not consider client mode for job execution. This is because the client node has limited resources. If some of the ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access