O'Reilly logo

Apache Mahout Essentials by Jayani Withanawasam

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 5. Apache Mahout in Production

This chapter talks about achieving scalability in Apache Mahout with an Apache Hadoop ecosystem.

In this chapter, we will cover the following topics:

  • Key components of Apache Hadoop
  • The life cycle of a Hadoop application
  • Setting up Hadoop
    • Local mode
    • The pseudo-distributed mode
    • The fully-distributed mode
  • Setting up Apache Mahout with Hadoop
  • Monitoring Hadoop
  • Troubleshooting Hadoop
  • Optimization tips

Introduction

So far, we have discussed key machine learning techniques, such as clustering, classification, and recommendations. However, there are several machine learning libraries, such as MATLAB, R, and Weka out there to implement the preceding techniques.

The volume of available information is growing at an alarming rate. ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required