
42 | Big Data Simplied
3.3 CONFIGURING A HADOOP CLUSTER
One Hadoop cluster consists of master and slave machine (Linux box). The main congura-
tion les of Hadoop cluster are
‘hadoop-env.sh’, ‘core-site.xml’, ‘hdfs-site.xml’,
‘mapred-site.xml’ and ‘yarn-site.xml’. Hadoop package has a dened le structure
and these les are in the path
‘$HADOOP_HOME/etc/hadoop’. Here, $HADOOP_HOME is the
Hadoop software package path like
‘/usr/local/hadoop’.
Hadoop cluster can be configured in three modes as explained below.
• Standalone Mode: This is the default mode to configure a Hadoop cluster. This mode is
mainly used for debugging and testing purposes, ...