O'Reilly logo

Apache Mahout Essentials by Jayani Withanawasam

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Summary

In this chapter we learned that classification and regression are supervised learning problems, which require labeled data. Predictor variables and output variables should be defined to come up with a model during the training phase.

We also saw that supervised learning can be achieved using different techniques, namely model-based, regression-based and tree-based techniques.

Regression can be divided into two categories based on the outcome of the algorithm, that is linear regression and logistic regression.

We saw that text classification is an important application and this is explained using the Naïve Bayes algorithm.

In the next chapter, we will discuss the recommendation techniques using Apache Mahout.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required