9

 

 

 

EVALUATION

 

If you’re familiar with the mechanics of machine learning, you might have noticed the absence of training and test data in the models used in the exercises thus far. An explanation for this vital question will be revealed later in this chapter, but, first, let’s review the rationale of split validation.

The partition of a dataset into training data and test data, known as split validation, is a fundamental part of machine learning. The training data is used to detect general patterns and design a prediction model, while the test data is used to road-test the model and compare the results. Thus, if we reserve 30% of the data and test it with the model developed from patterns discovered in the initial 70% of the data, ...

Get Machine Learning: Make Your Own Recommender System now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.