book

Learning LangChain

by Mayo Oshin, Nuno Campos

February 2025

Beginner to intermediate

296 pages

6h 39m

English

O'Reilly Media, Inc.

Book available

Read now

Unlock full access

Includes

Includes Quizzes

Brief Primer on LLMsInstruction-Tuned LLMsDialogue-Tuned LLMsFine-Tuned LLMsBrief Primer on PromptingZero-Shot PromptingChain-of-ThoughtRetrieval-Augmented GenerationTool CallingFew-Shot PromptingLangChain and Why It’s ImportantWhat to Expect from This BookConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
Getting Set Up with LangChainUsing LLMs in LangChainMaking LLM Prompts ReusableGetting Specific Formats out of LLMsJSON OutputOther Machine-Readable Formats with Output ParsersAssembling the Many Pieces of an LLM ApplicationUsing the Runnable InterfaceImperative CompositionDeclarative CompositionSummary
The Goal: Picking Relevant Context for LLMsEmbeddings: Converting Text to NumbersEmbeddings Before LLMsLLM-Based EmbeddingsSemantic Embeddings ExplainedConverting Your Documents into TextSplitting Your Text into ChunksGenerating Text EmbeddingsStoring Embeddings in a Vector StoreGetting Set Up with PGVectorWorking with Vector StoresTracking Changes to Your DocumentsIndexing OptimizationMultiVectorRetrieverRAPTOR: Recursive Abstractive Processing for Tree-Organized RetrievalColBERT: Optimizing EmbeddingsSummary
Introducing Retrieval-Augmented GenerationRetrieving Relevant DocumentsGenerating LLM Predictions Using Relevant DocumentsQuery TransformationRewrite-Retrieve-ReadMulti-Query RetrievalRAG-FusionHypothetical Document EmbeddingsQuery RoutingLogical RoutingSemantic RoutingQuery ConstructionText-to-Metadata FilterText-to-SQLSummary
Building a Chatbot Memory SystemIntroducing LangGraphCreating a StateGraphAdding Memory to StateGraphModifying Chat HistoryTrimming MessagesFiltering MessagesMerging Consecutive MessagesSummary
Architecture #1: LLM CallArchitecture #2: ChainArchitecture #3: RouterSummary
The Plan-Do LoopBuilding a LangGraph AgentAlways Calling a Tool FirstDealing with Many ToolsSummary
ReflectionSubgraphs in LangGraphCalling a Subgraph DirectlyCalling a Subgraph with a FunctionMulti-Agent ArchitecturesSupervisor ArchitectureSummary
Structured OutputIntermediate OutputStreaming LLM Output Token-by-TokenHuman-in-the-Loop ModalitiesMultitasking LLMsSummary
PrerequisitesInstall DependenciesLarge Language ModelVector StoreBackend APICreate a LangSmith AccountUnderstanding the LangGraph Platform APIData ModelsFeaturesDeploying Your AI Application on LangGraph PlatformCreate a LangGraph API ConfigTest Your LangGraph App LocallyDeploy from the LangSmith UILaunch LangGraph StudioSecuritySummary

Testing Techniques Across the LLM App Development CycleThe Design Stage: Self-Corrective RAGThe Preproduction StageCreating DatasetsDefining Your Evaluation CriteriaRegression TestingEvaluating an Agent’s End-to-End PerformanceProductionTracingCollect Feedback in ProductionClassification and TaggingMonitoring and Fixing ErrorsSummary
Interactive ChatbotsCollaborative Editing with LLMsAmbient ComputingSummary

Content preview from Learning LangChain

Chapter 3. RAG Part II: Chatting with Your Data

In the previous chapter, you learned how to process your data and create and store embeddings in a vector store. In this chapter, you’ll learn how to efficiently retrieve the most relevant embeddings and chunks of documents based on a user’s query. This enables you to construct a prompt that contains relevant documents as context, improving the accuracy of the LLM’s final output.

This process—which involves embedding a user’s query, retrieving similar documents from a data source, and then passing them as context to the prompt sent to the LLM—is formally known as retrieval-augmented generation (RAG).

RAG is an essential component of building chat-enabled LLM apps that are accurate, efficient, and up-to-date. In this chapter, you’ll progress from basics to advanced strategies to build an effective RAG system for various data sources (such as vector stores and databases) and data structures (structured and unstructured).

But first, let’s define RAG and discuss its benefits.

Introducing Retrieval-Augmented Generation

RAG is a technique used to enhance the accuracy of outputs generated by LLMs by providing context from external sources. The term was originally coined in a paper by Meta AI researchers who discovered that RAG-enabled models are more factual and specific than non-RAG models.¹

Without RAG, the LLM relies solely on its pretrained data, which may be outdated. For example, let’s ask ChatGPT a question about a current event ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Publisher Resources

ISBN: 9781098167271Errata Page Supplemental Content

Learning LangChain

by Mayo Oshin, Nuno Campos

Chapter 3. RAG Part II: Chatting with Your Data

Introducing Retrieval-Augmented Generation

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Learning Go, 2nd Edition

Learning Go

Learning Spark, 2nd Edition

Designing Data-Intensive Applications

Publisher Resources

Chapter 3. RAG Part II: Chatting with Your Data

Introducing Retrieval-Augmented Generation

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

You might also like

Learning Go, 2nd Edition

Learning Go

Learning Spark, 2nd Edition

Designing Data-Intensive Applications

Publisher Resources

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.