Chapter 5. Learning from Data

In this chapter, we will cover the following recipes:

  • Predicting continuous values using linear regression
  • Binary classification using LogisticRegression and SVM
  • Binary classification using LogisticRegression with the Pipeline API
  • Clustering using K-means
  • Feature reduction using principal component analysis

Introduction

In previous chapters, we saw how to load, prepare, and visualize data. Now, let's start doing some interesting stuff with it. In this chapter, we'll be looking into applying various machine learning techniques on top of it. We'll look at a few examples for the two broad classifications of machine learning techniques: supervised and unsupervised learning. Before that, however, let's briefly see what these ...

Get Scala Data Analysis Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.