September 2017
Beginner to intermediate
412 pages
8h 55m
English
Apache Hadoop is an open-source software system that allows for the distributed storage and processing of very large datasets. It implements the MapReduce framework.
The system includes these modules:
Hadoop originated as the Google File System in 2003. Its developer, Doug Cutting, named it after his son's toy elephant. By 2006, it had become HDFS, the ...
Read now
Unlock full access