
319Open Source Support for Cloud
The Apache Hadoop framework is composed of the following modules:
• Hadoop common: It contains libraries and utilities needed by other
Hadoop modules.
• Hadoop distributed le system (HDFS): It is a distributed le system
that stores data on the commodity machines, providing very high-
aggregate bandwidth across the cluster.
• Hadoop YARN: It is a resource management platform responsible for
managing compute resources in clusters and using them for sched-
uling of users’ applications.
• Hadoop MapReduce: It is a programming model for large-scale data
processing.
All the modules in Hadoop are designed with a funda ...