Chapter 12. Deployment of AI Services
In this final chapter, it is time to complete your GenAI solution by deploying it. You’re going to learn several deployment strategies and, as part of deployment, containerize your services with Docker following its best practices.
Deployment Options
You now have a working GenAI service that you want to make accessible to your users. What are your deployment options? There are a few common deployment strategies you can adapt to make your apps accessible to users:
-
Virtual machines (VMs)
-
Serverless functions
-
Managed application platforms
-
Containerization
Let’s explore each in more detail.
Deploying to Virtual Machines
If you plan to use your own on-premises servers or prefer to deploy your services on the same hardware hosting your other applications for high isolation and security, you can deploy your GenAI service to a VM.
A VM is a software emulation of a physical computer running an operating system (OS) and applications. It’s no different from a physical computer like a laptop, smartphone, or server.
The VM’s host system provides resources such as CPU, memory, ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access