Skip to Content
Scaling Machine Learning with Spark
book

Scaling Machine Learning with Spark

by Adi Polak
March 2023
Intermediate to advanced
291 pages
8h 54m
English
O'Reilly Media, Inc.
Content preview from Scaling Machine Learning with Spark

Chapter 10. Deployment Patterns for Machine Learning Models

Throughout this book, we’ve been discussing the machine learning lifecycle. As a quick reminder, at a high level, the lifecycle of a machine learning system is similar to the software development lifecycle. This means it includes multiple stages, which we can summarize as follows:

Development
Training the model
Validation
Validating the model
Staging
Testing the model in a production-like environment
Deployment
Putting the machine learning system into production
Archiving
Retiring the model and, if necessary, replacing it with a new version

In the previous chapters, we’ve covered the first few stages of the lifecycle in depth, including various tools and methods for distributed training. In this final chapter, I will provide guidance on how to think through the deployment process and what considerations you should be aware of. Deployment takes place once you have a model that produces accurate results that you are content with and you’re ready to serve it and put it into production. If this is not the case, it’s best to continue exploring with additional algorithms and parameters, and perhaps fresh data.

When thinking about deploying a model, we need to define when and where it will be used in the overall production system workflow. It may be part of a bigger data flow, or it may be a standalone application exposing APIs for users to interact with. The model can also be wrapped and served as a UDF as part of a Spark ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Deep Learning with PyTorch

Deep Learning with PyTorch

Eli Stevens, Luca Pietro Giovanni Antiga, Thomas Viehmann
Machine Learning for High-Risk Applications

Machine Learning for High-Risk Applications

Patrick Hall, James Curtis, Parul Pandey

Publisher Resources

ISBN: 9781098106812Errata Page