O'Reilly logo

Apache Mahout Essentials by Jayani Withanawasam

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Apache Mahout with Hadoop

Apache Mahout uses Apache Hadoop, which is a distributed computing framework, to achieve scalability. The following figure clearly shows the place where Apache Hadoop fits into Apache Mahout:

Apache Mahout with Hadoop

As shown in the previous figure, Yarn (Data processing) and HDFS (Data Storage) are key components in Apache Hadoop.

In this chapter, we will explain the important subcomponents of Yet Another Resource Negotiator (YARN) and HDFS and their behavior in detail before proceeding to the Hadoop installation steps.

Apache Mahout with Hadoop

YARN with MapReduce 2.0

First, ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required