Skip to Content
Python: Advanced Predictive Analytics
book

Python: Advanced Predictive Analytics

by Ashish Kumar, Joseph Babcock
December 2017
Beginner to intermediate
660 pages
15h 31m
English
Packt Publishing
Content preview from Python: Advanced Predictive Analytics

Understanding the mathematics behind decision trees

The main goal in a decision tree algorithm is to identify a variable and classification on which one can give a more homogeneous distribution with reference to the target variable. The homogeneous distribution means that similar values of the target variable are grouped together so that a concrete decision can be made.

Homogeneity

In the preceding example, the first goal would be to find a parameter (out of four: Terrain, Rainfall, Groundwater, and Fertilizers) that results in a better homogeneous distribution of the target variable within those categories.

Without any parameter, the count of harvest type looks as follows:

Bumper

Moderate

Meagre

4

9

7

Let us calculate, for each parameter, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Mastering Predictive Analytics with Python

Mastering Predictive Analytics with Python

Joseph Babcock

Publisher Resources

ISBN: 9781788992367Supplemental Content