We'll now gain an understanding of some of the design decisions of HDFS and how they mitigate some of the bottlenecks associated with a large dataset's storage and processing in a distributed manner. It's time to take a deep dive into the HDFS architecture. The following diagram represents the logical components of HDFS:
For simplicity's sake, you can divide the architecture into two groups. One group can be called the data group. It consists of processes/components that are related to file storage. The other group can be called the management group. It consists of processes/components that are used to manage data ...