book

What Is LLMOps?

by Abi Aryan

May 2024

Intermediate to advanced

48 pages

1h 5m

English

O'Reilly Media, Inc.

Book available

Read now

Unlock full access

LLM Applications: Breakthrough or Hype?The Looming Transformation: Five Key Applications of LLMsKnowledge RetrievalTranslationProgrammingAudio-Speech SynthesisRecommender SystemsAutonomous AgentsConclusion
Operationalizing LLMsChallengesThe Goals of LLMOpsSafetyScalabilityRobustnessThe Model Life CycleDevelopmentDeploymentServingConclusion
Step 1: Data EngineeringData CollectionData PreprocessingData StorageData ManagementStep 2: PretrainingTokenizationCheckpointingModel Versioning and LoggingStep 3: Choosing a Base ModelStep 4: Domain AdaptationPrompt EngineeringRetrieval-Augmented GenerationFine-TuningStep 5: Model EvaluationEvaluation ChallengesTools and GuardrailsStep 6: Integration and OrchestrationStep 7: Security and Reliability EngineeringStep 8: Deployment and MonitoringDeployment PatternsModel ServingDeployment StrategyConclusion

Overview

Large language models (LLMs), a subcategory of generative AI, have taken the world by storm. Commonly known for their application in ChatGPT, LLMs have unleashed new energy among developers and businesses looking to integrate AI into their applications. But the internet is also full of disjointed information about LLM applications and how to integrate and deploy them reliably into products and applications.

In this report, Abi Aryan takes you through the process of developing a cohesive framework for efficiently and reliably using LLMs to supercharge your applications. It's ideal for data scientists, machine-learning engineers, data engineers, and software engineers.

You'll examine:

The difference between LLM demos and efficient ML products that require a more robust framework
How LLMs like GPT-4 can be incredibly complex and resource-intensive
Key challenges of operationalizing LLMs, including massive data requirements, model size and complexity, performance monitoring, and security and privacy

About the author:

Abi Aryan is an independent consultant with more than 7 years of experience using and adapting ML research to solve real-world engineering challenges.