14 Optimizing Models and Using AutoML

So far, we've looked at a few machine learning (ML) models for classification and regression: simple linear models (linear regression and logistic regression), k-nearest neighbors (KNN), and Naïve Bayes for classification. As we will see in these next few chapters, there are other models that are commonly used in ML and data science. This chapter will cover how to choose between models and how to optimize models. Specifically, we'll cover:

Hyperparameter optimization with random, grid, and Bayesian searches
Using learning curves to optimize the amount of data needed and diagnose ML models
Optimizing the number of features with recursive feature selection
Using the pycaret AutoML Python package

Let's get ...

Get Practical Data Science with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Practical Data Science with Python by Nathan George

14

Optimizing Models and Using AutoML

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly