Chapter 10: Scaling Up Your Machine Learning Workflow

In this chapter, you will learn about diverse techniques and patterns to scale your machine learning (ML) workflow in different scalability dimensions. We will look at using a Databricks managed environment to scale your MLflow development capabilities, adding Apache Spark for cases where you have larger datasets. We will explore NVIDIA RAPIDS and graphics processing unit (GPU) support, and the Ray distributed frameworks to accelerate your ML workloads. The format of this chapter is a small proof-of-concept with a defined canonical dataset to demonstrate a technique and toolchain.

Specifically, we will look at the following sections in this chapter:

Developing models with a Databricks Community ...

Get Machine Learning Engineering with MLflow now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Machine Learning Engineering with MLflow by Natu Lauchande

Chapter 10: Scaling Up Your Machine Learning Workflow

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly