book

Generative AI on Microsoft Azure

by Adrián González Sánchez, Jaime De Mora, Jorge García Ximénez

April 2026

Intermediate to advanced

322 pages

10h 16m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Preface
Adrián: Crossing the Generative AI ChasmJaime: Shipping AI Responsibly in the Real WorldJorge: Architecting Enterprise-Grade Generative AIHow This Book Is OrganizedConventions Used in This BookO’Reilly Online LearningHow to Contact UsAcknowledgments
1. Technical Fundamentals of Generative AI Models and Applications
Evolution of the Adoption PatternsTypes of Generative AI ModelsTechnical Concepts Behind Generative AIPretrainingPost-TrainingInferenceMapping to Microsoft AzureConclusion
2. From Prototype to Production: AI on Azure
Overview of the Advanced AI Ecosystem on AzureDeep Dive on Microsoft Foundry: Models, Workflows, and SafetyKey Capabilities of Microsoft FoundryModel Diversity and Multiprovider StrategiesAzure AI Model Catalog: Many Models, One PlatformModel ProvidersDeployment Patterns on AzureBuilding Agents in AzureMicrosoft Copilot and Copilot StudioIntegration Patterns with Copilot StudioPutting It Together: Copilot Integration ScenariosVector DatabasesAzure AI SearchAzure Cosmos DB with Vector SearchSelection Criteria and Best PracticesTypical Use Cases for Vector DBs in Generative AI SolutionsAdditional Data PlatformsAzure DatabricksSnowflake Cortex on AzureNVIDIA NIM on AzureWeb Application Frameworks for AIAzure App Service for AI Web AppsStreamlit: Rapid UI for Data and AIGitHub Copilot Models and IntegrationsModels Behind GitHub CopilotCopilot IntegrationsGitHub Copilot Integrations and EcosystemConclusion
3. Model Selection and Prompt Engineering Best Practices
Considerations for Model SelectionChoosing Your Model on the Microsoft FoundrySelection via Model CatalogModel Deployment TypesAzure OpenAI ModelsAdditional Microsoft Foundry CapabilitiesPrompt Engineering Best PracticesLaying the Foundations for Effective PromptsPrompt Engineering TechniquesKey TakeawaysPrompt Engineering in Microsoft FoundryConclusion
4. Retrieval-Augmented Generation
Fundamentals of RAGThe RAG ProcessRAG in AzureRAG with Azure AI SearchData Ingestion and IndexingRetrieval and Generation WorkflowIterative Optimization of Your RAG PipelinesNew Foundry and Foundry IQAdvanced RAG ScenariosMultimodal RAGGraphRAGNL2SQLAgentic RAGRAFTLooking Ahead with RAGConclusion
5. Fine-Tuning Generative AI Models in Azure
When to Fine-Tune Your Generative AI ModelsPrompt Engineering Versus RAG Versus Fine-TuningFine-Tuning TaxonomySFTPEFTFine-Tuning in Microsoft FoundryFine-Tuning Azure OpenAI ModelsStep-by-Step Fine-Tuning Workflow for Azure OpenAI ModelsStep-by-Step Workflow for Fine-Tuning Other Catalog Models in Microsoft FoundryFine-Tuning Best Practices and PitfallsConclusion
6. Agentic Systems
When to Use Agentic SystemsAgentic ProtocolsMCP in DetailAgentic Design PatternsSequential PatternWorkflow PatternConcurrent PatternGroup-Chat PatternSelf-Reflection/Cross-Reflection PatternHandoff Pattern/Human in the LoopSupervisor PatternMagentic Orchestration PatternSummary of PatternsBuilding Agentic Systems in AzureHow the Microsoft Foundry Agent Service WorksSupercharging Agents: Tool Catalog and Foundry IQFoundry Workflows: Orchestrating Multiagent ProcessesMicrosoft Agent Framework: From Semantic Kernel and AutoGen to PresentHosted Agents: Deploying Agent Framework Apps to AzureConclusion
7. GenAIOps and LLMOps in Azure
From MLOps to GenAIOps: Why It MattersGenerative AI in Azure: The Operating LoopUse Case: Multiagent Financial Compliance AssistantDefine and ExploreQuantitative Analysis and Monitoring: Core Model Performance MetricsModel Selection and EvaluationTools and FrameworksBuild and CustomizeDesigning the Application ArchitectureBuild and Customize: ExampleExperimentation and Prompt EngineeringEvaluation and ValidationCode Templates and AcceleratorsFine-Tuning GuidanceObserve and OptimizeCI/CDDeployment and Release ManagementMonitoring in ProductionAutomation of Model Lifecycle ManagementTools for Tracking, Versioning, and ReproducibilityProtect and GovernSafety and Guardrails by DesignSecurity and Access ControlGovernance and AccountabilityLinking Protect and Govern with the GenAIOps LifecycleConclusion
8. Generative AI Governance Framework
Thought Process for Your AI GovernanceLevels of AI GovernanceTypical AI Governance AssetsInternal Context at Organizational LevelOther ConsiderationsKey Building BlocksAzure AI Content SafetyMicrosoft PurviewMicrosoft DefenderMicrosoft EntraMicrosoft Responsible AI (RAI) ToolboxPyRIT (Python Risk Identification Tool)HAX (Human-AI eXperience) ToolkitAI Bill of Materials (AI BOM)HiddenLayerHow-To: Implementing Generative AI Governance on AzurePreliminary, General First StepsHuman + AI ApproachSecuring Your AI Systems Against Human MisuseAI Models, Data, and InfrastructureLast but Not Least: Are You the Responsible AI Champion?Conclusion
9. Expert Interviews
Innovating and Exploring with Microsoft Azure, with Marco CasalainaThe Art of Designing AI Architectures, with James SerraBuilding the Future of AI Platforms, with Eric BoydThe Evolution and Future of Frontier AI, with Yina Arenas

A. AI Performance Metrics
AI Model PerformanceAI System Performance
B. Additional Resources and Reading Materials
Learning Microsoft Azure, by Jonah Carrio AnderssonDescriptionWhy This Book?Azure OpenAI Service for Cloud Native Applications, by Adrián González SánchezDescriptionWhy This Book?Fundamentals of Microsoft Fabric, by Nikola Ilic and Ben WeissmanDescriptionWhy This Book?Cloud Native Infrastructure with Azure, by Nishant Singh and Michael KehoeDescriptionWhy This Book?Application Delivery and Load Balancing in Microsoft Azure, by Derek DeJonghe and Arlan NugaraDescriptionWhy This Book?Azure AI Fundamentals (AI-900) Study Guide, by Tom TaulliDescriptionWhy This Book?Azure AI Engineer Associate (AI-102) Study Guide, by Renaldi GondosubrotoDescriptionWhy This Book?AI Engineering, by Chip HuyenDescriptionWhy This Book?Building Generative AI Services with FastAPI, by Alireza ParandehDescriptionWhy This Book?Generative AI on AWS, by Chris Fregly, Antje Barth, and Shelbee EigenbrodeDescriptionWhy This Book?
Glossary
Index
About the Authors

Content preview from Generative AI on Microsoft Azure

Chapter 5. Fine-Tuning Generative AI Models in Azure

In this chapter, we will discuss when fine-tuning GenAI models is necessary and walk through the different fine-tuning approaches available in Azure, helping you determine the best strategy for optimizing model performance.

Fine-tuning represents a powerful method for adapting large language models (LLMs) to specialized tasks and domains by updating their internal parameters based on new data. Positioned next to prompt engineering and RAG, fine-tuning offers a means of embedding knowledge directly into the model, allowing it to internalize patterns, terminology, or behaviors that may not be well represented in its original training corpus. Unlike prompting, which relies on instructive input without altering the model, or RAG, which dynamically incorporates external context at inference time, fine-tuning permanently refines the model’s behavior. This approach enables more precise control over outputs, improved performance on domain-specific tasks, and reduced dependence on lengthy prompts or external retrieval mechanisms. In this chapter, we will explore what conditions are ideal for fine-tuning.

The fine-tuning process begins by continuing the training of a pretrained model on a new, typically smaller dataset that reflects the desired application domain or task. This is achieved by adjusting the model’s parameters to better capture the linguistic patterns, knowledge, or behaviors relevant to the target use case. There are two ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9798341623279Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Generative AI on Microsoft Azure

by Adrián González Sánchez, Jaime De Mora, Jorge García Ximénez

Chapter 5. Fine-Tuning Generative AI Models in Azure

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.