Skip to Content
Hands-On RAG for Production
book

Hands-On RAG for Production

by Ofer Mendelevitch, Forrest Sheng Bao
June 2026
Intermediate to advanced
358 pages
10h 52m
English
O'Reilly Media, Inc.
Content preview from Hands-On RAG for Production

Foreword by Sharon Zhou

The first time I saw a RAG system fail in production, it was because someone had naively chunked their documents on fixed character boundaries and split a legal clause in half. Clause A was in one chunk with some of clause B, and the rest of clause B was in another. The problem was that the second chunk provided a useful, common exception. The RAG system retrieved the first chunk but not the second based on the user’s question, so unfortunately, the model answered the user’s question with the opposite of what the contract said. Just think: if you were given incomplete or faulty knowledge through a Google search, you’d also have trouble giving the right answer.

No one building the system had been thinking about chunking strategy, not critically. They had been busy debating about which LLM to use. That’s why a book like this is so important for those building RAG with LLMs and agents in production.

RAG looks deceptively simple:

  • Chunk your documents—easy, that’s a string split.
  • Embed your chunks—easy, that’s a lightweight model API call in a for loop.
  • Retrieve the relevant chunks—easy, that’s just using search, which has been around a lot longer than modern AI, so in a way, it should have best practices baked in already.
  • Hand those chunks to an LLM—easy, that’s just appending strings to another string to form a prompt.

You can build a working prototype by one-shotting a language model. But… you can also spend the next year working through parsing, chunking, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Building LLMs for Production

Building LLMs for Production

Louis-Francois Bouchard, Louie Peters
LLMs in Production

LLMs in Production

Matthew Sharp, Christopher Brousseau

Publisher Resources

ISBN: 9798341621701Errata Page