May 2019
Intermediate to advanced
306 pages
8h 20m
English
Gini Impurity is defined as the measurement of the likelihood of the incorrect classification of a random observation, given that the random observation is classified based on the distribution of the class variables in the dataset. Consider a dataset with
class variables, and
is the fraction of observations in the dataset labeled as
. Gini Impurity can be calculated using the following formula:
.. 4.1
Read now
Unlock full access