Skip to Content
Generative AI in Action
book

Generative AI in Action

by Amit Bahree
November 2024
Intermediate to advanced
464 pages
14h 38m
English
Manning Publications
Content preview from Generative AI in Action

11 Scaling up: Best practices for production deployment

This chapter covers

  • Challenges and deployment options to consider for an application ready for production
  • Production best practices covering scalability, latency, caching, and managed identities
  • Observability of LLM applications, with some practical examples
  • LLMOps and how it compliments MLOps

When organizations are ready to take their generative AI models from the realm of proof of concept (PoC) to the real world of production, they embark on a journey that requires careful consideration of key aspects. This chapter will discuss deployment and scaling options, sharing best practices for making generative AI solutions operational, reliable, performant, and secure.

Deploying and ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Generative AI with LangChain

Generative AI with LangChain

Ben Auffarth
Introduction to Generative AI

Introduction to Generative AI

Numa Dhamani, Maggie Engler

Publisher Resources

ISBN: 9781633436947Supplemental ContentPublisher SupportOtherPublisher WebsiteSupplemental ContentPurchase Link