Chapter 15. How to Group Data: Classification and Clustering

If a machine is expected to be infallible, it cannot also be intelligent.

—Alan Turing, from “Computing Machinery and Intelligence”, 1950

In the previous chapters, you encountered the term classification many times. Classification is one of the most common problems in machine learning, and it can be tackled in various ways, including decision trees, Bayesian classifiers, and even logistic regression.

In this chapter, we’ll present two more sophisticated algorithms for classification, and then we’ll move on to address a subtly similar problem—clustering. According to most dictionaries, classification is the act of arranging a group of things in homogeneous classes based on their characteristics. ...

Get Introducing Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.