Skip to Content
Data Science on AWS
book

Data Science on AWS

by Chris Fregly, Antje Barth
April 2021
Intermediate to advanced
521 pages
13h 33m
English
O'Reilly Media, Inc.
Book available
Content preview from Data Science on AWS

Chapter 10. Pipelines and MLOps

In previous chapters, we demonstrated how to perform each individual step of a typical ML pipeline, including data ingestion, analysis, and feature engineering—as well as model training, tuning, and deploying.

In this chapter, we tie everything together into repeatable and automated pipelines using a complete machine learning operations (MLOps) solution with SageMaker Pipelines. We also discuss various pipeline-orchestration options, including AWS Step Functions, Kubeflow Pipelines, Apache Airflow, MLFlow, and TensorFlow Extended (TFX).

We will then dive deep into automating our SageMaker Pipelines when new code is committed, when new data arrives, or on a fixed schedule. We describe how to rerun a pipeline when we detect statistical changes in our deployed model, such as data drift or model bias. We will also discuss the concept of human-in-the-loop workflows, which can help to improve our model accuracy.

Machine Learning Operations

The complete model development life cycle typically requires close collaboration between the application, data science, and DevOps teams to successfully productionize our models, as shown in Figure 10-1.

Figure 10-1. Productionizing machine learning applications requires collaboration between teams.

Typically, the data scientist delivers the trained model, the DevOps engineer manages the infrastructure that hosts the ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Engineering with AWS

Data Engineering with AWS

Gareth Eagar
Data Engineering with Python and AWS Lambda LiveLessons

Data Engineering with Python and AWS Lambda LiveLessons

Noah Gift, Robert Jordan, Kennedy Behrman

Publisher Resources

ISBN: 9781492079385Errata PageSupplemental Content