Without big data analytics, companies are blind and deaf, wandering out onto the Web like deer on a freeway.
The goal of this chapter is to introduce the main components of a Hadoop cluster. It is important that someone new to Hadoop get an overall understanding of what Hadoop is and its major components before looking at Hadoop in detail. We introduce a number of Hadoop distributions. At the end of this chapter you will understand the main Hadoop software processes and Hadoop hardware profiles. We finish the chapter with the different roles needed in a Hadoop environment.
Hadoop can store data of different types from lots of different sources. Let’s start by taking ...