Skip to Content
Applied AI for Enterprise Java Development
book

Applied AI for Enterprise Java Development

by Alex Soto Bueno, Markus Eisele, Natale Vinto
November 2025
Intermediate to advanced
430 pages
10h 29m
English
O'Reilly Media, Inc.
Content preview from Applied AI for Enterprise Java Development

Chapter 5. Embedding Vectors, Vector Stores, and Running Models Locally

This chapter introduces three key concepts that make up the foundation of almost all AI-powered applications: embedding vectors, vector stores, and their combination with augmented queries in an architecture called retrieval-augmented generation. We will also tell you more about local model inferencing. We focus on the practical use of local LLMs and how to interact with them via Java-based tools and frameworks. Especially for developers, this is essential to allow effective integration of AI capabilities into applications on their local machines.

You’ll learn how embeddings capture semantic meaning from raw input, how vector stores enable efficient similarity-based retrieval, and how these components integrate with LLMs to power features like semantic search, classification, and long-context memory. The emphasis is on running these capabilities locally for performance, cost, privacy, or offline requirements.

This is a foundational chapter that prepares you for the hands-on implementations in the rest of the book. It builds the necessary understanding of how embeddings and local inference relate to each other, so you can confidently apply them to Java applications in the chapters that follow.

Embedding Vectors and Their Role

Before LLMs can reason about data, they need a way to interpret it. They do this with numbers. This is why we need to talk about embedding vectors. In this section, you’ll learn what ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Java SE 17 Developer (1Z0-829)

Java SE 17 Developer (1Z0-829)

Simon Roberts
AI Codecon: Coding with AI—The End of Software Development as We Know It

AI Codecon: Coding with AI—The End of Software Development as We Know It

Tim O'Reilly, Addy Osmani, Gergely Orosz, Kent Beck, Camille Fournier, Avi Flombaum, Maxi Ferreira, Harper Reed, Jay Parikh, Birgitta Böckeler, Angie Jones, Craig McLuckie, Patty O’Callaghan, Chip Huyen, swyx, Andrew Stellman, Phillip Carter, Nikola Balic, Brett Smith, Chelsea Troy, Lili Jiang, Iyanuoluwa Ajao
Generative AI on AWS

Generative AI on AWS

Chris Fregly, Antje Barth, Shelbee Eigenbrode

Publisher Resources

ISBN: 9781098174491Errata Page