July 2017
Intermediate to advanced
796 pages
18h 55m
English
Metadata checkpointing saves information defining the streaming operations, which are represented by a Directed Acyclic Graph (DAG) to the HDFS. This can be used to recover the DAG, if there is a failure and the application is restarted. The driver restarts and reads the metadata from HDFS, and rebuilds the DAG and recovers all the operational state before the crash.
Metadata includes the following:
Read now
Unlock full access