Chapter 2. Hadoop Fundamental Concepts

Without big data analytics, companies are blind and deaf, wandering out onto the Web like deer on a freeway.

—Geoffrey Moore

The goal of this chapter is to introduce the main components of a Hadoop cluster. It is important that someone new to Hadoop get an overall understanding of what Hadoop is and its major components before looking at Hadoop in detail. We introduce a number of Hadoop distributions. At the end of this chapter you will understand the main Hadoop software processes and Hadoop hardware profiles. We finish the chapter with the different roles needed in a Hadoop environment.

Types of Data in Hadoop

Hadoop can store data of different types from lots of different sources. Let’s start by taking ...

Get Virtualizing Hadoop: How to Install, Deploy, and Optimize Hadoop in a Virtualized Architecture now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.