HDFS logical architecture

We'll now gain an understanding of some of the design decisions of HDFS and how they mitigate some of the bottlenecks associated with a large dataset's storage and processing in a distributed manner. It's time to take a deep dive into the HDFS architecture. The following diagram represents the logical components of HDFS:

For simplicity's sake, you can divide the architecture into two groups. One group can be called the data group. It consists of processes/components that are related to file storage. The other group can be called the management group. It consists of processes/components that are used to manage data ...

Get Mastering Hadoop 3 now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.