O'Reilly logo

Virtualizing Hadoop: How to Install, Deploy, and Optimize Hadoop in a Virtualized Architecture by George J. Trujillo Jr., Justin Murray, Rommel Garcia, Steven Jones, Charles Kim

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 2. Hadoop Fundamental Concepts

Without big data analytics, companies are blind and deaf, wandering out onto the Web like deer on a freeway.

—Geoffrey Moore

The goal of this chapter is to introduce the main components of a Hadoop cluster. It is important that someone new to Hadoop get an overall understanding of what Hadoop is and its major components before looking at Hadoop in detail. We introduce a number of Hadoop distributions. At the end of this chapter you will understand the main Hadoop software processes and Hadoop hardware profiles. We finish the chapter with the different roles needed in a Hadoop environment.

Types of Data in Hadoop

Hadoop can store data of different types from lots of different sources. Let’s start by taking ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required