Skip to Content
Retrieval-Augmented Generation in Production with Haystack
book

Retrieval-Augmented Generation in Production with Haystack

by Skanda Vivek
April 2025
Intermediate to advanced content levelIntermediate to advanced
132 pages
3h 1m
English
O'Reilly Media, Inc.

Overview

In today's rapidly changing AI technology environment, software engineers often struggle to build real-world applications with large language models (LLM). The benefits of incorporating open source LLMs into existing workflows is often offset by the need to create custom components. That's where Haystack comes in. This open source framework is a collection of the most useful tools, integrations, and infrastructure building blocks to help you design and build scalable, API-driven LLM backends.

With Haystack, it's easy to build extractive or generative QA, Google-like semantic search to query large-scale textual data, or a reliable and secure ChatGPT-like experience on top of technical documentation. This guide serves as a collection of useful retrieval-augmented generation (RAG) mental models and offers ML engineers, AI engineers, and backend engineers a practical blueprint for the LLM software development lifecycle.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Building AI Agents with LangGraph: Creating Agentic Applications with Large Language Models and LangGraph

Building AI Agents with LangGraph: Creating Agentic Applications with Large Language Models and LangGraph

Sajal Sharma
LLMs in Production

LLMs in Production

Christopher Brousseau, Matthew Sharp
Building LLMs for Production

Building LLMs for Production

Louis-Francois Bouchard, Louie Peters
Developing Apps with GPT-4 and ChatGPT

Developing Apps with GPT-4 and ChatGPT

Olivier Caelen, Marie-Alice Blete

Publisher Resources

ISBN: 9781098165161Errata PageSupplemental Content