Skip to Content
Practical Data Science with Python
book

Practical Data Science with Python

by Nathan George
September 2021
Beginner to intermediate
620 pages
15h 30m
English
Packt Publishing
Content preview from Practical Data Science with Python

15

Tree-Based Machine Learning Models

We've seen a few of the simpler machine learning models, and now it's time to examine some more advanced models. In this chapter, we will look at the family of machine learning models that is based on decision trees. These models, especially the boosted models, have won machine learning contests and are used in industry for state-of-the-art ML performance. Here, we'll cover:

  • How decision trees work in machine learning
  • Random forests in sklearn and H2O, which are collections of decision trees
  • Feature importances from tree-based methods
  • Boosted algorithms, including AdaBoost, XGBoost, LightGBM, and CatBoost

Let's start with the basic decision tree and how it works.

Decision trees

Decision trees are simple ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Science Projects with Python - Second Edition

Data Science Projects with Python - Second Edition

Stephen Klosterman
Python: End-to-end Data Analysis

Python: End-to-end Data Analysis

Phuong Vothihong, Martin Czygan, Ivan Idris, Magnus Vilhelm Persson, Luiz Felipe Martins

Publisher Resources

ISBN: 9781801071970Supplemental Content