Skip to Content
Generative AI on AWS
book

Generative AI on AWS

by Chris Fregly, Antje Barth, Shelbee Eigenbrode
November 2023
Intermediate to advanced
312 pages
8h 15m
English
O'Reilly Media, Inc.
Book available
Content preview from Generative AI on AWS

Chapter 9. Context-Aware Reasoning Applications Using RAG and Agents

In this chapter, you will explore how to bring together everything you’ve learned so far to build context-aware reasoning applications. To do this, you will explore retrieval-augmented generation (RAG) and agents. You will also learn about frameworks called LangChain, ReAct, and PAL, which make RAG and agent workflows much easier to implement and maintain. Both RAG and agents are often key components of a generative AI application.

With RAG, you augment the context of your prompts with relevant information needed to address knowledge limitations of LLMs and improve the relevancy of the model’s generated output. RAG has grown in popularity due to its effectiveness in mitigating challenges such as knowledge cutoffs and hallucinations by incorporating dynamic data sources into the prompt context without needing to continually fine-tune the model as new data arrives into your system.

RAG can be integrated with off-the-shelf foundation models or with fine-tuned and human-aligned models specific to your generative use case and domain.

Note

RAG and fine-tuning can be used together. They are not mutually exclusive.

Next, some general guidance to consider when deciding which techniques should be applied. If access to external data or dynamic data is required, then RAG-based architectures can enable this without continuous fine-tuning, which would become cost prohibitive. Also, RAG-based techniques do not require much ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Kubernetes for the Absolute Beginners - Hands-On

Kubernetes for the Absolute Beginners - Hands-On

KodeKloud
Building AI Agents with LLMs: Harnessing the Power of Generative AI with Autonomous Agents

Building AI Agents with LLMs: Harnessing the Power of Generative AI with Autonomous Agents

Abi Aryan, Shawn “swyx” Wang, Div Garg, Kence Anderson, Yohei Nakajima, Jaya Gupta, Arjun Bansal

Publisher Resources

ISBN: 9781098159214Errata PageSupplemental Content