July 2017
Beginner to intermediate
378 pages
10h 26m
English
The depth or shallowness of a decision tree refers to how many levels of hierarchies it has. The smaller the number of hierarchies, the more shallow the tree. GBM uses successive weak learners, which are trained on a measure of error from the previous model. A weak learner means it has some predictive power but not much. A shallow tree is typically weak, as it is only using one or a small number of features to split the training data.
The resulting predictions are added together to arrive at a final prediction. There are several variants that use different methods of estimating error and determining how shallow to make each tree.
GBMs work by successively dialing in on higher error areas using the error of the previous model ...