Skip to Main Content
Joe Celko’s Complete Guide to NoSQL
book

Joe Celko’s Complete Guide to NoSQL

by Joe Celko
October 2013
Beginner to intermediate content levelBeginner to intermediate
244 pages
5h 53m
English
Morgan Kaufmann
Content preview from Joe Celko’s Complete Guide to NoSQL
Chapter 4

MapReduce Model

Abstract

The MapReduce model was developed by Google and Yahoo for their internal use. Google created the Hadoop distributed file system and Yahoo developed Pig Latin to handle their volume of data. These products became open source. Hadoop dominates the NoSQL market as part of the SMAQ stack, the NoSQL counterpart of the LAMP stack for websites. The process has two phases: mapping and reducing. The mapping phase gets the data in a parallelized fashion. The reduce phase filters and aggregates this data to produce a final result.

Keywords

ETL (extract transform load); Google; Hadoop; HDFS (Hadoop distributed file system); LAMP stack; MapReduce; Pig Latin; RAID storage systems; SMAQ stack; Yahoo

Introduction

This chapter discusses ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

NoSQL for Mere Mortals®

NoSQL for Mere Mortals®

Dan Sullivan
Seven NoSQL Databases in a Week

Seven NoSQL Databases in a Week

Sudarshan Kadambi, Xun (Brian) Wu

Publisher Resources

ISBN: 9780124071926