Skip to Main Content
Big Data Simplified
book

Big Data Simplified

by Sayan Goswami, Amit Kumar Das, Sourabh Mukherjee
June 2019
Beginner to intermediate content levelBeginner to intermediate
360 pages
10h 55m
English
Pearson Education India
Content preview from Big Data Simplified
54 | Big Data Simplied
will then look up in its internal table of contents to see where the first block (Block 1) of
File1 is stored. In this case, it happens to be DataNode 1.
The NameNode then forwards this request to the DataNode 1 to read the actual contents from
Block 1, and it is this content that is returned back to the client (Figure 3.6).
For example, three files are in HDFS with the size of a.txt (256 MB), b.txt (289 MB) and c.txt
(370 MB). Thus, HDFS will allocate a total of 8 blocks (default size of a block 128 MB) for these
three files. Here, a.txt will consume 2 blocks, b.txt and c.txt will absorb 3 blocks, respectively.
3.4.3 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Big Data

Big Data

James Warren, Nathan Marz
Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications

Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications

Gary D. Miner, John Elder, Andrew Fast, Thomas Hill, Robert Nisbet, Dursun Delen
Data Wrangling with Python

Data Wrangling with Python

Jacqueline Kazil, Katharine Jarmul

Publisher Resources

ISBN: 9789353941505