1. Background and Concepts

In This Chapter:

Image The Apache Hadoop project is introduced along with a working definition of Big Data.

Image The concept of a Hadoop data lake is developed and contrasted with traditional data storage methods.

Image A basic overview of the Hadoop MapReduce process is presented.

Image The evolution of Hadoop version 1 (V1) to Hadoop version ...

Get Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data Computing in the Apache Hadoop 2 Ecosystem now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.