O'Reilly logo

Joe Celko’s Complete Guide to NoSQL by Joe Celko

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 4

MapReduce Model

Abstract

The MapReduce model was developed by Google and Yahoo for their internal use. Google created the Hadoop distributed file system and Yahoo developed Pig Latin to handle their volume of data. These products became open source. Hadoop dominates the NoSQL market as part of the SMAQ stack, the NoSQL counterpart of the LAMP stack for websites. The process has two phases: mapping and reducing. The mapping phase gets the data in a parallelized fashion. The reduce phase filters and aggregates this data to produce a final result.

Keywords

ETL (extract transform load); Google; Hadoop; HDFS (Hadoop distributed file system); LAMP stack; MapReduce; Pig Latin; RAID storage systems; SMAQ stack; Yahoo

Introduction

This chapter discusses ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required