Table of Contents
Preface
Part 1 – Introduction to Retrieval-Augmented Generation (RAG)
1
What Is Retrieval-Augmented Generation (RAG)
Understanding RAG – Basics and principles
Advantages of RAG
Challenges of RAG
RAG vocabulary
LLM
Prompting, prompt design, and prompt engineering
LangChain and LlamaIndex
Inference
Context window
Fine-tuning – full-model fine-tuning (FMFT) and parameter-efficient fine-tuning (PEFT)
Vector store or vector database?
Vectors, vectors, vectors!
Vectors
Implementing RAG in AI applications
Comparing RAG with conventional generative AI
Comparing RAG with model fine-tuning
The architecture of RAG systems
Summary
2
Code Lab – An Entire RAG Pipeline
Technical requirements
No interface!
Setting up a large language model ...
Get Unlocking Data with Generative AI and RAG now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.