October 2018
Beginner
220 pages
5h 33m
English
HDFS offers unlimited storage with scalability, so it can be used as an archival storage system. The following Data Flow Diagram (DFD) depicts the pattern of HDFS as an archive store:

All of the sources supply data in real time to the Primary Database, which provides faster access. This data, once it is stored and utilized, is periodically moved to archival storage in HDFS for data recovery and change logging. HDFS can also process this data and provide analytics over time, whereas the primary database continues to serve the requests that demand real time data.
The pros of this pattern are as follows:
Read now
Unlock full access