O'Reilly logo

Hands-On Natural Language Processing with Python by Rajalingappaa Shanmugamani, Rajesh Arumugam

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Deploying Trained Models

In this chapter, you will learn how to deploy trained deep learning models to production environments on various platforms, such as cloud and mobile. For cloud deployment, the latency and throughput are important. The latency has to be at a minimum, and the throughput has to be high. The performance largely depends on the model and hardware. There are several optimizations available for CPU and GPU. For mobile platforms, speed, and energy consumption are important.

In this chapter, you will learn techniques to meet your deployment goals through the following topics:

  • Increasing performance by changing models
  • Using the TensorFlow serving tool
  • Deploying to cloud services, such as AWS, GCP, and Azure
  • Deploying to mobile ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required