July 2018
Beginner to intermediate
376 pages
9h 1m
English
In an ensemble, we have many base models—say L number of them. For the classification problem, we have base models as classifiers. If we have a regression problem, we have the base models as learners. Since the diagnostics are performed on the training dataset only, we will drop the convention of train and valid partitions. For simplicity, during the rest of the discussion, we will assume that we have N observations. The L number of models implies that we have L predictions for each of the N observations, and thus the number of predictions is
. It is in these predictions that we try to find the diversity of the ensemble. The diversity ...