Skip to Content
RAG with Python Cookbook
book

RAG with Python Cookbook

by Dominik Polzer
May 2026
Intermediate to advanced
378 pages
8h 17m
English
O'Reilly Media, Inc.
Content preview from RAG with Python Cookbook

Chapter 5. Embeddings

Embedding models convert text, images, and other content into vectors that capture semantic meaning. In a RAG system, these vectors let the retriever search large unstructured collections for content that is relevant to a user’s question. The retriever embeds the query, compares it to vectors stored in a vector database, and ranks candidates by distance. Smaller distances indicate higher semantic similarity, which determines what fits into the LLM’s limited context window.

A typical embedding-based retrieval flow looks like this:

  1. Split documents into chunks and embed them when building the vector database.

  2. Embed each incoming user query with the same model.

  3. Compute distances between the query vector and stored vectors.

  4. Retrieve the closest chunks and pass them to the LLM as context.

This chapter shows how to work with embedding models from providers such as OpenAI, Google, and open source projects. You will generate embeddings, visualize semantic relationships, measure vector distances, and use them in practical RAG pipelines. The recipes also cover model selection, multimodal embeddings, and hybrid retrieval that combines vectors with keyword or metadata filters.

You can find all the code examples for this chapter in the book’s GitHub repository.

5.1 Mapping the Linguistic Meaning of Text Chunks to a Numerical Representation

Problem

You want to map the semantic meaning of words and sentences into a numerical representation.

Solution

Use an ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Python Polars: The Definitive Guide

Python Polars: The Definitive Guide

Jeroen Janssens, Thijs Nieuwdorp

Publisher Resources

ISBN: 9798341600553Errata Page