O'Reilly logo

Apache Mahout Essentials by Jayani Withanawasam

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 3. Regression and Classification

This chapter explains the regression and classification technique in machine learning and its implementation using different machine learning algorithms in Apache Mahout. The machine learning theory behind the algorithm and real-world applications with example scripts are also explained.

In this chapter, we will cover the following topics:

  • Supervised learning
  • Target variables and predictor variables
  • Predictive analytics techniques
  • Classification versus regression
  • Linear regression with Apache Spark
  • Logistic regression with Stochastic Gradient Descent (SGD)
  • Naïve Bayes algorithm
  • Hidden Markov Models (HMMs)

Supervised learning

Supervised learning is a machine learning technique that requires labeled training data.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required