12 Evaluating Machine Learning Classification Models and Sampling for Classification

Once we have some classification models trained to predict our target variable, we need a way to compare them and choose the best one. One way to compare models is to use metrics such as accuracy and others. In classification, we can often find that our classes or targets are imbalanced. We can improve the performance of ML classification algorithms by means of sampling techniques, such as oversampling and undersampling. In this chapter, we will learn about ways to evaluate our classification models and sampling methods:

How to evaluate the performance of our algorithms (performance metrics)
Sampling imbalanced data for classification

Let's start with metrics ...

Get Practical Data Science with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Practical Data Science with Python by Nathan George

12

Evaluating Machine Learning Classification Models and Sampling for Classification

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly