Binary decisions

Let's consider an input dataset, X:

Every vector is made up of m features, so each of them is a candidate for the creation of a node based on a tuple (feature, threshold):

Single splitting node

According to the feature and the threshold, the structure of the tree will change. Intuitively, we should pick the feature that best separates our data. In other words, a perfectly separating feature will only be present in a node, and the two subsequent branches won't be based on it anymore. These conditions guarantee the convergence ...

Get Machine Learning Algorithms - Second Edition now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.