15

Tree-Based Machine Learning Models

We've seen a few of the simpler machine learning models, and now it's time to examine some more advanced models. In this chapter, we will look at the family of machine learning models that is based on decision trees. These models, especially the boosted models, have won machine learning contests and are used in industry for state-of-the-art ML performance. Here, we'll cover:

  • How decision trees work in machine learning
  • Random forests in sklearn and H2O, which are collections of decision trees
  • Feature importances from tree-based methods
  • Boosted algorithms, including AdaBoost, XGBoost, LightGBM, and CatBoost

Let's start with the basic decision tree and how it works.

Decision trees

Decision trees are simple ...

Get Practical Data Science with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.