June 2020
Intermediate to advanced
382 pages
11h 39m
English
The distinguishing feature of decision tree classification is the generation of the human-interpretable hierarchy of rules that are used to predict the label at runtime. The algorithm is recursive in nature. Creating this hierarchy of rules involves the following steps:
Find the most important feature: Out of all of the features, the algorithm identifies the feature that best differentiates between the data points in the training dataset with respect to the label. The calculation is based on metrics such as information gain or Gini impurity.
Bifurcate: Using the most identified important feature, the algorithm creates a criterion that is used to divide the training dataset into two ...