Skip to Content
Big Data Analytics with Java
book

Big Data Analytics with Java

by RAJAT MEHTA
July 2017
Beginner to intermediate
418 pages
9h 46m
English
Packt Publishing
Content preview from Big Data Analytics with Java

Summary

In this chapter, we covered a very important and popular algorithm in machine learning called as decision trees. A decision tree is very similar to a flowchart and is based on a set of rules. A decision tree algorithm learns from a dataset and builds a set of rules. Based on these rules, it splits the dataset into two (in the case of binary splits) or more parts. When a new data is fed in for predictions based on the attributes of the data, a particular path is taken and this follows along the full path of rules in the tree until a particular response is reached.

There are many ways in which we can split data in a decision tree. We explored two of the most common ways called Entropy and Gini Impurity. In either of these cases, the main ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Science with Java

Data Science with Java

Michael R. Brzustowicz
Data Science on AWS

Data Science on AWS

Chris Fregly, Antje Barth
Machine Learning: End-to-End guide for Java developers

Machine Learning: End-to-End guide for Java developers

Richard M. Reese, Jennifer L. Reese, Boštjan Kaluža, Dr. Uday Kamath, Krishna Choppella

Publisher Resources

ISBN: 9781787288980Supplemental Content