Overview
Use FastAPI to expose an HTTP API for fast live predictions using an ONNX Machine Learning Model. FastAPI is a Python web framework that provides easy development of
documented HTTP APIs by offering self-documented endpoints with Swagger - a tool to describe, document, and use RESTful web services.
Learn how to quickly put together an API which validates requests, and self-documents its endpoints using OpenAPI via Swagger. Quickly produce a robust interface for others to consume
your Machine Learning model by following core best-practices of MLOps.
Parts of this video cover the basics of packaging Machine Learning models, as covered in the Practical MLOps book.
Topics include:
* Create a Python project to serve live predictions using FastAPI
* Use a Dockerfile to package the model and the API using Docker containerization
* With minimal Python code, expose an ONNX model to perform sentiment analysis over an HTTP endpoint
* Dynamically interact with the API using the self-documented endpoint in the container.
Useful links:
* Demo Github Repository with sample code
* Practical MLOps book
* FastAPI Intro tutorial
* RoBERTa ONNX Model for sentiment analysis
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Watch now
Unlock full access