O'Reilly logo

Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools by Deepak Vohra

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

© Deepak Vohra 2016

Deepak Vohra, Practical Hadoop Ecosystem, 10.1007/978-1-4842-2199-0_11

11. Apache Mahout

Deepak Vohra

(1)Apt 105, White Rock, British Columbia, Canada

Apache Mahout is a scalable machine learning library with support for several classification, clustering, and collaborative filtering algorithms. Mahout runs on top of Hadoop using the MapReduce model. Mahout also provides a Java API. This chapter explains how to get started with Mahout; you’ll install Mahout and run some sample Mahout applications. You will also see how to develop a user recommender system using the Mahout Java API. This chapter covers the following topics:

  • Setting the environment

  • Configuring and starting HDFS

  • Setting the Mahout environment

  • Running a Mahout classification ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required