Chapter 15. How to Group Data: Classification and Clustering

If a machine is expected to be infallible, it cannot also be intelligent.

—Alan Turing, from “Computing Machinery and Intelligence”, 1950

In the previous chapters, you encountered the term classification many times. Classification is one of the most common problems in machine learning, and it can be tackled in various ways, including decision trees, Bayesian classifiers, and even logistic regression.

In this chapter, we’ll present two more sophisticated algorithms for classification, and then we’ll move on to address a subtly similar problem—clustering. According to most dictionaries, classification is the act of arranging a group of things in homogeneous classes based on their characteristics. ...

Get Introducing Machine Learning now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.