July 2017
Intermediate to advanced
360 pages
8h 26m
English
To define the most used impurity measures, we need to consider the total number of target classes:
![]()
In a certain node j, we can define the probability p(i|j) where i is an index [1, n] associated with each class. In other words, according to a frequentist approach, this value is the ratio between the number of samples belonging to class i and the total number of samples belonging to the selected node.
Read now
Unlock full access