Fault-tolerance components and mechanism in Apex

In Chapter 2, Getting Started with Application Development, we looked at the deployment of an Apex application when it is executing on a YARN cluster. Let's revisit the diagram to see which type of failures may occur and how they are handled by the system:

The client is only required for launching the application; it is not involved in the execution of the DAG on the cluster, and failure of the client node does not affect the pipeline. Since Apex is running on YARN, let's first see how YARN supports resilient applications (from a user's perspective).

YARN consists of a resource manager (RM) ...

Get Learning Apache Apex now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.