Chapter 7. Spring with Hadoop

Processing large chunks of data has been a major challenge in architecting modern day web applications. Hadoop is an open source framework from Apache that provides libraries to process and store large chunks of data. It offers a scalable, cost-effective, and fault-tolerant solution to store and process large chunks of data. In this chapter, let us demonstrate how the Spring Framework supports Hadoop. Map and Reduce, Hive, and HDFS are some of the Hadoop key terminology used with cloud-based technologies. Google has also come with its own Map and Reduce and distributed file system framework, apart from Apache Hadoop.

Apache Hadoop modules

Apache Hadoop consists of the following modules:

Hadoop Common: This is a common ...

Get Mastering Spring Application Development now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Mastering Spring Application Development by Anjana Mankale

Chapter 7. Spring with Hadoop

Apache Hadoop modules

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly