8 Simplifying Deep Learning Model Deployment

The deep learning (DL) models that are deployed in production environments are often different from the models that are fresh out of the training process. They are usually augmented to handle incoming requests with the highest performance. However, the target environments are often too broad, so a lot of customization is necessary to cover vastly different deployment settings. To overcome this difficulty, you can make use of open neural network exchange (ONNX), a standard file format for ML models. In this chapter, we will introduce how you can utilize ONNX to convert DL models between DL frameworks and how it separates the model development process from deployment.

In this chapter, we’re going to ...

Get Production-Ready Applied Deep Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Production-Ready Applied Deep Learning by Tomasz Palczewski, Jaejun Lee, Lenin Mookiah

8

Simplifying Deep Learning Model Deployment

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly