December 2018
Beginner to intermediate
684 pages
21h 9m
English
The two key drivers of gradient boosting performance are the size of the ensemble and the complexity of its constituent decision trees.
The control of complexity for decision trees aims to avoid learning highly specific rules that typically imply a very small number of samples in leaf nodes. We covered the most effective constraints used to limit the ability of a decision tree to overfit to the training data in the previous chapter. They include requiring:
In addition to directly controlling the size ...