As we move on, we will start to utilize generative methods. The first generative method we will experiment with is boosting. We will first try to classify the datasets using AdaBoost. As AdaBoost resamples the dataset based on misclassifications, we expect that it will be able to handle our imbalanced dataset relatively well.
First, we must decide on the ensemble's size. We generate validation curves for a number of ensemble sizes depicted as follows:
As we can observe, 70 base learners provide the best trade-off between bias and variance. As such, we will proceed with ensembles ...