Anomaly Detection – Finding Outliers in Data

Detecting anomalies in data is a recurring theme in machine learning. In Chapter 10, Imbalanced Learning – Not Even 1% Win the Lottery, we learned how to spot these interesting minorities in our data. Back then, the data was labeled and the classification algorithms from the previous chapters were apt for the problem. Aside from labeled anomaly detection problems, there are cases where data is unlabeled.

In this chapter, we are going to learn how to identify outliers in our data, even when no labels are provided. We will use three different algorithms and we will learn about the two branches of unlabeled anomaly detection. Here are the topics that will be covered in this chapter:

  • Unlabeled anomaly ...

Get Hands-On Machine Learning with scikit-learn and Scientific Python Toolkits now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.