July 2017
Beginner to intermediate
715 pages
17h 3m
English
Holding out only one part of the data may not always be the best option. What we can do instead is splitting it into K parts and then testing the models only on 1/Kth of the data.
This is called k-fold cross-validation; it not only gives the performance estimation, but also the possible spread of the error. Typically, we are interested in models that give good and consistent performance. K-fold cross-validation helps us to select such models.
Next we prepare the data for k-fold cross-validation as follows:
If we translate this into Java, the first step will look ...
Read now
Unlock full access