6

Tools for Model Training and Experimenting

In this chapter, we will focus on creating training datasets and building baseline models. We start by building training sets from a training DataFrame and feature tables. You will learn how to combine feature table data with your training data without using traditional joins. We will also return to Databricks AutoML and explore how to use it to establish a baseline model quickly. We then cover how to experiment with different features, hyperparameters, and models when searching for predictive signals in your training data. Manually tracking configurations and their corresponding evaluation metrics is time-consuming. We introduce a component of MLflow called MLflow Tracking, which significantly improves ...

Get Databricks ML in Action now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.