HDFS to manage all the machines and storage spaces in the cluster. It does that by setting up a
master-slave configuration and it will be discussed further as follows.
Within the cluster, one of the machines is designated as the master node. The master node
is responsible for coordinating the storage across all other nodes on the cluster, which are slave
nodes. On this master node, HDFS runs a process, which receives all requests that are made to
the cluster, and forwards it to the slave nodes which contain the data. Let us examine this process
in detail. The master node is called the NameNode. All other machines in the ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month, and much more.