4 Regularization with Tree-Based Models

Tree-based models using ensemble learning such as Random Forest or Gradient Boosting are often seen as easy-to-use, state-of-the-art models for regular machine learning tasks.

Many Kaggle competitions have been won with such models, as they can be quite robust and efficient at finding complex patterns in data. Knowing how to regularize and fine-tune them is key to having the very best performance.

In this chapter, we’ll look at the following recipes:

Building a classification tree
Building regression trees
Regularizing a decision tree
Training a Random Forest algorithm
Regularization of Random Forest
Training a boosting model with XGBoost
Regularization with XGBoost

Technical requirements

In this chapter, ...

Get The Regularization Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

The Regularization Cookbook by Vincent Vandenbussche

4

Regularization with Tree-Based Models

Technical requirements

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly