July 2017
Beginner to intermediate
378 pages
10h 26m
English
Hadoop MapReduce is a core component of the Hadoop framework. It is a software framework that enables in-parallel processing distributed across nodes in the cluster. The topic can get confusing as people tend to refer to the concept of MapReduce and the Hadoop component interchangeably, as if they were all one and the same.
The concept of Map and Reduce will be covered in the Spark section later on in this chapter. It is not tied to any specific framework or project. You may read articles that imply that MapReduce is going away, to be replaced by something else. It may not be clearly spelled out (and often it is not), but they are referring to the Hadoop component and not to the concept.
The concept of Map and Reduce is not ...