Chapter 4: Tracking Code and Data Versioning

DL models are not just models – they are intimately tied to the code that trains and tests the model and the data that's used for training and testing. If we don't track the code and data that's used for the model, it is impossible to reproduce the model or improve it. Furthermore, there have been recent industry-wide awakenings and paradigm shifts toward a data-centric AI (https://www.forbes.com/sites/gilpress/2021/06/16/andrew-ng-launches-a-campaign-for-data-centric-ai/?sh=5cbacdc574f5), where the importance of data is being lifted to a first-class artifact in building ML and, especially, DL models. Due to this, in this chapter, we will learn how to track code and data versioning using MLflow. We ...

Get Practical Deep Learning at Scale with MLflow now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Practical Deep Learning at Scale with MLflow by Yong Liu, Dr. Matei Zaharia

Chapter 4: Tracking Code and Data Versioning

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly