The Hadoop jungle explained
In this section, we will briefly explain the components on Hadoop and point you to related study materials. Many of these components will be referred to throughout the book.
Big data tamed
We know that big data is everywhere and that it needs to be processed and analyzed into something meaningful. But how do we process the data without breaking the bank?
Hadoop, the hero, can tame this big data monster because of the following features:
- It is enterprise grade but runs on commodity servers. Storing big data on traditional database storage is very expensive, generally in the order of $25,000 to $50,000 per terabyte per year. However, with Hadoop, the cost of storing data on commodity servers drops by 90 percent to the order ...