book

Generative AI on Microsoft Azure

by Adrián González Sánchez, Jaime De Mora, Jorge García Ximénez

April 2026

Intermediate to advanced

322 pages

10h 16m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Preface
Adrián: Crossing the Generative AI ChasmJaime: Shipping AI Responsibly in the Real WorldJorge: Architecting Enterprise-Grade Generative AIHow This Book Is OrganizedConventions Used in This BookO’Reilly Online LearningHow to Contact UsAcknowledgments
1. Technical Fundamentals of Generative AI Models and Applications
Evolution of the Adoption PatternsTypes of Generative AI ModelsTechnical Concepts Behind Generative AIPretrainingPost-TrainingInferenceMapping to Microsoft AzureConclusion
2. From Prototype to Production: AI on Azure
Overview of the Advanced AI Ecosystem on AzureDeep Dive on Microsoft Foundry: Models, Workflows, and SafetyKey Capabilities of Microsoft FoundryModel Diversity and Multiprovider StrategiesAzure AI Model Catalog: Many Models, One PlatformModel ProvidersDeployment Patterns on AzureBuilding Agents in AzureMicrosoft Copilot and Copilot StudioIntegration Patterns with Copilot StudioPutting It Together: Copilot Integration ScenariosVector DatabasesAzure AI SearchAzure Cosmos DB with Vector SearchSelection Criteria and Best PracticesTypical Use Cases for Vector DBs in Generative AI SolutionsAdditional Data PlatformsAzure DatabricksSnowflake Cortex on AzureNVIDIA NIM on AzureWeb Application Frameworks for AIAzure App Service for AI Web AppsStreamlit: Rapid UI for Data and AIGitHub Copilot Models and IntegrationsModels Behind GitHub CopilotCopilot IntegrationsGitHub Copilot Integrations and EcosystemConclusion
3. Model Selection and Prompt Engineering Best Practices
Considerations for Model SelectionChoosing Your Model on the Microsoft FoundrySelection via Model CatalogModel Deployment TypesAzure OpenAI ModelsAdditional Microsoft Foundry CapabilitiesPrompt Engineering Best PracticesLaying the Foundations for Effective PromptsPrompt Engineering TechniquesKey TakeawaysPrompt Engineering in Microsoft FoundryConclusion
4. Retrieval-Augmented Generation
Fundamentals of RAGThe RAG ProcessRAG in AzureRAG with Azure AI SearchData Ingestion and IndexingRetrieval and Generation WorkflowIterative Optimization of Your RAG PipelinesNew Foundry and Foundry IQAdvanced RAG ScenariosMultimodal RAGGraphRAGNL2SQLAgentic RAGRAFTLooking Ahead with RAGConclusion
5. Fine-Tuning Generative AI Models in Azure
When to Fine-Tune Your Generative AI ModelsPrompt Engineering Versus RAG Versus Fine-TuningFine-Tuning TaxonomySFTPEFTFine-Tuning in Microsoft FoundryFine-Tuning Azure OpenAI ModelsStep-by-Step Fine-Tuning Workflow for Azure OpenAI ModelsStep-by-Step Workflow for Fine-Tuning Other Catalog Models in Microsoft FoundryFine-Tuning Best Practices and PitfallsConclusion
6. Agentic Systems
When to Use Agentic SystemsAgentic ProtocolsMCP in DetailAgentic Design PatternsSequential PatternWorkflow PatternConcurrent PatternGroup-Chat PatternSelf-Reflection/Cross-Reflection PatternHandoff Pattern/Human in the LoopSupervisor PatternMagentic Orchestration PatternSummary of PatternsBuilding Agentic Systems in AzureHow the Microsoft Foundry Agent Service WorksSupercharging Agents: Tool Catalog and Foundry IQFoundry Workflows: Orchestrating Multiagent ProcessesMicrosoft Agent Framework: From Semantic Kernel and AutoGen to PresentHosted Agents: Deploying Agent Framework Apps to AzureConclusion
7. GenAIOps and LLMOps in Azure
From MLOps to GenAIOps: Why It MattersGenerative AI in Azure: The Operating LoopUse Case: Multiagent Financial Compliance AssistantDefine and ExploreQuantitative Analysis and Monitoring: Core Model Performance MetricsModel Selection and EvaluationTools and FrameworksBuild and CustomizeDesigning the Application ArchitectureBuild and Customize: ExampleExperimentation and Prompt EngineeringEvaluation and ValidationCode Templates and AcceleratorsFine-Tuning GuidanceObserve and OptimizeCI/CDDeployment and Release ManagementMonitoring in ProductionAutomation of Model Lifecycle ManagementTools for Tracking, Versioning, and ReproducibilityProtect and GovernSafety and Guardrails by DesignSecurity and Access ControlGovernance and AccountabilityLinking Protect and Govern with the GenAIOps LifecycleConclusion
8. Generative AI Governance Framework
Thought Process for Your AI GovernanceLevels of AI GovernanceTypical AI Governance AssetsInternal Context at Organizational LevelOther ConsiderationsKey Building BlocksAzure AI Content SafetyMicrosoft PurviewMicrosoft DefenderMicrosoft EntraMicrosoft Responsible AI (RAI) ToolboxPyRIT (Python Risk Identification Tool)HAX (Human-AI eXperience) ToolkitAI Bill of Materials (AI BOM)HiddenLayerHow-To: Implementing Generative AI Governance on AzurePreliminary, General First StepsHuman + AI ApproachSecuring Your AI Systems Against Human MisuseAI Models, Data, and InfrastructureLast but Not Least: Are You the Responsible AI Champion?Conclusion
9. Expert Interviews
Innovating and Exploring with Microsoft Azure, with Marco CasalainaThe Art of Designing AI Architectures, with James SerraBuilding the Future of AI Platforms, with Eric BoydThe Evolution and Future of Frontier AI, with Yina Arenas

A. AI Performance Metrics
AI Model PerformanceAI System Performance
B. Additional Resources and Reading Materials
Learning Microsoft Azure, by Jonah Carrio AnderssonDescriptionWhy This Book?Azure OpenAI Service for Cloud Native Applications, by Adrián González SánchezDescriptionWhy This Book?Fundamentals of Microsoft Fabric, by Nikola Ilic and Ben WeissmanDescriptionWhy This Book?Cloud Native Infrastructure with Azure, by Nishant Singh and Michael KehoeDescriptionWhy This Book?Application Delivery and Load Balancing in Microsoft Azure, by Derek DeJonghe and Arlan NugaraDescriptionWhy This Book?Azure AI Fundamentals (AI-900) Study Guide, by Tom TaulliDescriptionWhy This Book?Azure AI Engineer Associate (AI-102) Study Guide, by Renaldi GondosubrotoDescriptionWhy This Book?AI Engineering, by Chip HuyenDescriptionWhy This Book?Building Generative AI Services with FastAPI, by Alireza ParandehDescriptionWhy This Book?Generative AI on AWS, by Chris Fregly, Antje Barth, and Shelbee EigenbrodeDescriptionWhy This Book?
Glossary
Index
About the Authors

Content preview from Generative AI on Microsoft Azure

Appendix A. AI Performance Metrics

This section contains all relevant quantitative performance metrics, including those for generative AI (GenAI) on Microsoft Azure at both model and system level.

AI Model Performance

These metrics are directly related to AI models (see Table A-1), including classification and regression tasks for machine learning and deep learning, as well as other specific metrics for traditional and modern language models. They are the baseline to evaluate the quantitative performance during both preliminary testing and postproduction maintenance stages.

Table A-1. AI model performance metrics
AI model type	Metric	Range of values	Purpose
Classification	AUC-ROC (area under curve)	0 to 1 (higher is better)	Measures how well a model distinguishes between classes
	Precision	0 to 1 (higher is better)	Measures the proportion of correctly identified positive results out of total predicted positives
	Recall	0 to 1 (higher is better)	Measures the proportion of actual positives correctly identified
	F1 score (based on precision and recall)	0 to 1 (higher is better)	Balances precision and recall for imbalanced datasets
	F2 score (based on precision and recall)	0 to 1 (higher is better)	A weighted average of precision and recall, giving more importance to recall and to catch true positives
Regression	MAE (mean absolute error)	0 to ∞ (lower is better)	Measures average absolute error between predicted and actual values ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9798341623279Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Generative AI on Microsoft Azure

by Adrián González Sánchez, Jaime De Mora, Jorge García Ximénez

Appendix A. AI Performance Metrics

AI Model Performance

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.