9

Evaluating RAG Quantitatively and with Visualizations

Evaluation plays a crucial role in building and maintaining retrieval-augmented generation (RAG) pipelines. While you build the pipeline, you can use evaluation to identify areas for improvement, optimize the system’s performance, and systematically measure the impact of improvements. When your RAG system is deployed, evaluation can help ensure the effectiveness, reliability, and performance of the system.

In this chapter, we will cover the following topics:

  • Evaluating when building a RAG application
  • Evaluating a RAG application after deployment
  • Standardized evaluation frameworks
  • Ground truth
  • Code lab 9.1 – ragas
  • Additional evaluation techniques for RAG systems

Let’s start by talking ...

Get Unlocking Data with Generative AI and RAG now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.