book

Building AI Intensive Python Applications

Name: Building AI Intensive Python Applications
ISBN: 9781836207252

by Rachelle Palmer, Ben Perlmutter, Ashwin Gangadhar, Nicholas Larew, Sigfrido Narváez, Thomas Rueckstiess, Henry Weller, Richmond Alake, Shubham Ranjan

September 2024

Intermediate to advanced

298 pages

8h 12m

English

Packt Publishing

Read now

Unlock full access

Preface
Who this book is forWhat this book coversTo get the most out of this bookDownload the example code filesConventions usedGet in touchDownload a free PDF copy of this book
Chapter 1: Getting Started with Generative AI
Technical requirementsDefining the terminologyThe generative AI stackPython and GenAIOpenAI APIMongoDB with Vector SearchImportant features of generative AIWhy use generative AI?The ethics and risks of GenAISummary
Chapter 2: Building Blocks of Intelligent Applications
Technical requirementsDefining intelligent applicationsThe building blocks of intelligent applicationsLLMs – reasoning engines for intelligent appsUse cases for LLM reasoning enginesDiverse capabilities of LLMsMulti-modal language modelsA paradigm shift in AI developmentEmbedding models and vector databases – semantic long-term memoryEmbedding modelsVector databasesModel hostingYour (soon-to-be) intelligent appSample application – RAG chatbotImplications of intelligent applications for software engineeringSummary
Part 1: Foundations of AI: LLMs, Embedding Models, Vector Databases, and Application Design
Chapter 3: Large Language Models
Technical requirementsProbabilistic frameworkn-gram language modelsMachine learning for language modellingArtificial neural networksTraining an artificial neural networkANNs for natural language processingTokenizationEmbeddingPredicting probability distributionsDealing with sequential dataRecurrent neural networksTransformer architectureLLMs in practiceThe evolving field of LLMsPrompting, fine-tuning, and RAGSummary
Chapter 4: Embedding Models
Technical requirementsWhat is an embedding model?How do embedding models differ from LLMs?When to use embedding models versus LLMsTypes of embedding modelsChoosing embedding modelsTask requirementsDataset characteristicsComputational resourcesVector representationsEmbedding model leaderboardsEmbedding models overviewDo you always need an embedding model?Executing code from LangChainBest practicesSummary
Chapter 5: Vector Databases
Technical requirementsWhat is a vector embedding?Vector similarityExact versus approximate searchMeasuring searchGraph connectivityNavigable small worldsHow to search a navigable small worldHierarchical navigable small worldsThe need for vector databasesHow vector search enhances AI modelsCase studies and real-world applicationsOkta – natural language access request (semantic search)One AI – language-based AI (RAG over business data)Novo Nordisk – automatic clinical study generation (advanced RAG/RPA)Vector search best practicesData modelingDeploymentSummary
Chapter 6: AI/ML Application Design
Technical requirementsData modelingEnriching data with embeddingsConsidering search use casesData storageDetermining the type of database clusterDetermining IOPSDetermining RAMFinal cluster configurationPerformance and availability versus costData flowHandling static data sourcesStoring operational data enriched with vector embeddingsFreshness and retentionReal-time updatesData lifecycleAdopting new embedding modelsSecurity and RBACBest practices for AI/ML application designSummary
Part 2: Building Your Python Application: Frameworks, Libraries, APIs, and Vector Search
Chapter 7: Useful Frameworks, Libraries, and APIs
Technical requirementsPython for AI/MLAI/ML frameworksLangChainLangChain semantic search with scoreSemantic search with pre-filteringImplementing a basic RAG solution with LangChainLangChain prompt templates and chainsKey Python librariespandasPyMongoArrowPyTorchAI/ML APIsOpenAI APIHugging FaceSummary

Chapter 8: Implementing Vector Search in AI Applications
Technical requirementsInformation retrieval with MongoDB Atlas Vector SearchVector search tutorial in PythonVector Search tutorial with LangChainBuilding RAG architecture systemsChunking or document-splitting strategiesSimple RAGAdvanced RAGSummary
Part 3: Optimizing AI Applications: Scaling, Fine-Tuning, Troubleshooting, Monitoring, and Analytics
Chapter 9: LLM Output Evaluation
Technical requirementsWhat is LLM evaluation?Component and end-to-end evaluationsModel benchmarkingEvaluation datasetsDefining a baselineUser feedbackSynthetic dataEvaluation metricsAssertion-based metricsStatistical metricsLLM-as-a-judge evaluationsRAG metricsHuman reviewEvaluations as guardrailsSummary
Chapter 10: Refining the Semantic Data Model to Improve Accuracy
Technical requirementsEmbeddingsExperimenting with different embedding modelsFine-tuning embedding modelsEmbedding metadataFormatting metadataIncluding static metadataExtracting metadata programmaticallyGenerating metadata with LLMsIncluding metadata with query embedding and ingested content embeddingsOptimizing retrieval-augmented generationQuery mutationExtracting query metadata for pre-filteringFormatting ingested dataAdvanced retrieval systemsSummary
Chapter 11: Common Failures of Generative AI
Technical requirementsHallucinationsCauses of hallucinationsImplications of hallucinationsSycophancyCauses of sycophancyImplications of sycophancyData leakageCauses of data leakageImplications of data leakageCostTypes of costsTokensPerformance issues in generative AI applicationsComputational loadModel serving strategiesHigh I/O operationsSummary
Chapter 12: Correcting and Optimizing Your Generative AI Application
Technical requirementsBaseliningTraining and evaluation datasetsFew-shot promptingRetrieval and rerankingLate interaction strategiesQuery rewritingTesting and red teamingTestingRed teamingInformation post-processingOther remediesSummary
Appendix: Further Reading: Index
Why subscribe?
Other Books You May EnjoyPackt is searching for authors like youDownload a free PDF copy of this book

Content preview from Building AI Intensive Python Applications

9 LLM Output Evaluation

Regardless of the form factor of your intelligent application, you must evaluate your use of large language models (LLMs). The evaluation of a computational system determines the system’s performance, gauges its reliability, and analyzes its security and privacy.

AI systems are non-deterministic. You cannot be certain what an AI system will output until you run an input through it. This means that you must evaluate how the AI system performs on a variety of inputs to have confidence that it performs in line with your requirements. To be able to change the AI system without introducing any unexpected regressions, you also need to have robust evaluations. Evaluations can help catch these regressions before releasing the ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781836207252

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Building AI Intensive Python Applications

by Rachelle Palmer, Ben Perlmutter, Ashwin Gangadhar, Nicholas Larew, Sigfrido Narváez, Thomas Rueckstiess, Henry Weller, Richmond Alake, Shubham Ranjan

9

LLM Output Evaluation

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.