book

MLOps with Databricks

Name: MLOps with Databricks
Author: Maria Vechtomova
ISBN: 9798341608252

by Maria Vechtomova

August 2026

Intermediate to advanced

250 pages

7h 27m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Brief Table of Contents (Not Yet Final)
Preface
Why DatabricksWho this book is forHow this book is organized
1. MLOps principles and components
The rise of MLOpsMLOps componentsCode Version controlContinuous Integration and Continuous Delivery/Deployment (CI/CD)Workflow orchestrationModel and Container registryModel training and ServingMonitoringData version controlOther components to considerMLOPs principlesDocumentationCode qualityTraceability and ReproducibilityMonitoring and AlertingDatabricks MLOps componentsUnity catalogLakeflow JobsDatabricks computeServerless compute vs classic computeAll-purpose vs job computeMLflowFeature engineeringDatabricks ServingLakehouse MonitoringVector searchDatabricks Asset Bundles (DABs)Closing
2. Developing on Databricks
Cluster configurations on DatabricksAll-purpose and jobs compute configurationServerless computeProject setupThe datasetPython toolingpyproject.tomlProject code: data preprocessingRunning code in a Databricks notebookDatabricks development toolsDatabricks Command-Line Interface (CLI)Databricks ConnectDatabricks Asset BundlesVisual Studio (VS) Code ExtensionClosing
3. MLflow for Traditional ML
Experiment trackingTracking URIMLflow ExperimentMLflow RunTraining dataHyperparameter tuning and nested runsLogging and registering models with MLflowModel logging with sklearn flavourLogged modelRegistering the modelWrapping the model using pyfuncMLflow EvaluationResource and API limitsClosing
4. Model serving: architectures and implementation
Model servingEndpoint deploymentDebuggingFeature servingCreating a feature tableCreating a FeatureSpecCreating an Online TableEndpoint deploymentModel serving with feature lookupCreating feature functions and a feature tableDefining the training setRegister the model and publish an online tablePayload structureOAuth AuthenticationServing limitationsClosing
5. Machine Learning model deployment
Databricks Asset BundlesSyntax of the databricks.yml fileThe databricks.yml file exampleMachine learning pipelineData preprocessingTraining, evaluating and registering a modelDeploying model serving endpointServerless vs jobs compute for Lakeflow jobsDatabricks platform designPromoting code and models to higher environmentsGit branching modelContinuous Integration (CI) and Continuous Delivery and Continuous Deployment (CD) pipelinesBranch and deployment protection rulesPromoting models to higher environmentsTestingClosing
6. Monitoring
What to monitorRecommender SystemsRegression and Demand ForecastingLarge Language ModelsConcept Drift vs. Data DriftFairness, Bias, and Regulatory ComplianceSystem and Infrastructure HealthIntroduction to Lakehouse monitoringProfile metrics tableDrift metrics tableMetrics to monitor data driftFrom inference tables to Lakehouse monitorAlertingClosing
7. LLMOps
Introduction to the chapterWhat is an Agent?Chapter structureFoundation models on DatabricksHosting OptionsContext engineeringReference Architecture: The arXiv CuratorFrom raw data to agent contextMCP servers on Databricks and tool callingAgent memory with LakebaseMLflow for GenAIMLflow flavorsMLflow tracingDefining a custom agentLLM system evaluationEvaluate, log and register the agentLLM system deploymentDeploying a mosaic AI model serving endpointDeployment jobsDeployment considerationsLLM system monitoringWhat to Monitor in LLM SystemsMonitoring implementationConsiderationsClosing
About the Author

Content preview from MLOps with Databricks

Chapter 7. LLMOps

The number of Generative AI (GenAI) use cases has grown rapidly over the past few years, moving well beyond experimentation into applications that deliver tangible business value.

For example, Airbnb completed a large-scale LLM-driven code migration, updating nearly 3,500 React component test files. What was initially estimated to require 1.5 years of manual engineering effort was completed in just six weeks by combining frontier models with robust automation.

Similarly, DoorDash built AutoEval, an LLM-powered, human-in-the-loop system for evaluating the quality of search result pages. Instead of relying on slow and inconsistent human labeling, the system increased evaluation speed by 98% and expanded capacity by a factor of nine. Beyond scalability, it improved alignment with expert judgment and enabled continuous quality monitoring, allowing human experts ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9798341608245Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

MLOps with Databricks

by Maria Vechtomova

Chapter 7. LLMOps

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.