Introduction to clustering

Let's get started. In this section, we will contrast unsupervised learning with supervised learningthe latter is what we have been doing up to this point. We will then learn what clustering is.

Supervised and unsupervised learning are two different learning paradigms. So far in this book, we have been performing different kinds of supervised learning. There was a variable, which we called the target variable, that we wanted to predict. This was the case both for classification and regression. Learning consisted of finding a classifier that could accurately predict the target variable. The following diagram represents supervised learning:

With unsupervised learning, there is no target variable. We believe there ...

Get Training Systems Using Python Statistical Modeling now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.