14

Optimizing Models and Using AutoML

So far, we've looked at a few machine learning (ML) models for classification and regression: simple linear models (linear regression and logistic regression), k-nearest neighbors (KNN), and Naïve Bayes for classification. As we will see in these next few chapters, there are other models that are commonly used in ML and data science. This chapter will cover how to choose between models and how to optimize models. Specifically, we'll cover:

  • Hyperparameter optimization with random, grid, and Bayesian searches
  • Using learning curves to optimize the amount of data needed and diagnose ML models
  • Optimizing the number of features with recursive feature selection
  • Using the pycaret AutoML Python package

Let's get ...

Get Practical Data Science with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.