Chapter 18

Performing Cross-Validation, Selection, and Optimization

IN THIS CHAPTER

Learning about overfitting and underfitting

Choosing the right metric to monitor

Cross-validating the results

Selecting the best features for your model

Optimizing hyperparameters

This chapter is about how machine learning algorithms learn, and it explores some methods for making them learn better. Machine learning algorithms can indeed learn from data. For instance, the four algorithms presented in the previous chapter, although not complex, can effectively estimate a class or a value after being presented with examples associated with outcomes. It is all a matter of learning by induction, which is the process of extracting general rules from specific examples. From childhood, humans commonly learn by seeing examples, deriving some general rules or ideas from them, and then successfully applying the derived rule to new situations as we grow up. For example, if we see someone being burned after touching ...

Get Python for Data Science For Dummies, 3rd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Python for Data Science For Dummies, 3rd Edition by John Paul Mueller, Luca Massaron

Performing Cross-Validation, Selection, and Optimization

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly