November 2017
Intermediate to advanced
670 pages
17h 35m
English
This layer can store a large amount of data. Output is typically stored in a read-only database. Any errors can be fixed by recomputing based on a complete data set at which time views can be updated. Apache Hadoop is the de facto standard batch-processing system used in most high-throughput architectures. Response times can be measured in minutes or even hours.