book

Agentic AI Data Architectures

by Blaize Stewart, Ed Huang

January 2026

Intermediate to advanced

48 pages

1h 6m

English

O'Reilly Media, Inc.

Read now

Unlock full access

1. What Is Agentic AI and Why Memory Matters
What Makes AI “Agentic”?The Limits of ModelsDiminishing Returns on Model SizeThe Importance of ContextMemory: Persistence Across TimeShort-Term MemoryLong-Term MemoryEpisodic MemoryWhy Traditional Data Architecture Falls ShortHow AI Development Outpaced Data ArchitectureThe Misalignment of Stateless Infrastructure and MemoryToward Memory as Infrastructure
2. Memory as Infrastructure
The Fragmented Data EcosystemStructured DataUnstructured DataTemporal ContinuityCommon Failure Modes with Fragmented Memory StacksThe Latency Trap in Agentic AIScaling Limits of Legacy SystemsOperational Complexity: Orchestration Fragility, Monitoring Blind SpotsThe Case for Distributed SQLA Unified Retrieval FoundationDeclarative Power of SQL for Agentic RetrievalElasticityWith Agentic Apps, Distributed SQL WinsInfrastructure as a Platform for Doing More
3. Beyond Storage: Patterns for Agentic Applications
Beyond Storage: Semantic and Transactional PatternsSemantic-Transactional JoinContextual Fact AugmentationProbabilistic Joins for Ambiguous ContextMixed Workloads: Blending Real-Time Inference with Up-To-Date ContextSliding Window ContextMicrobatch RefreshPatterns for Retrieval in Agentic MemoryRetrieval‑Augmented GenerationLong-Term Memory: Persistent Stores for Agent ContinuityEpisodic Stores: Session-Based, Time-Bound Context RetentionTemporal Consistency RetrievalMultiagent Shared MemoryIncremental Fact SynchronizationBridging into Oversight
4. Operationalizing the AI Memory Layer
Latency: Enforcing Speed Under LoadElasticity and Isolation in Distributed SQL: Why It MattersPutting It TogetherGovernance: Baked-In, Not Layered on TopStandardized Metadata per RequestEnforced Access Control at the BoundaryImmutable Audit Logs for Every FetchAccuracy: A Continuous, Systemic ResponsibilityInstrumentation and Metrics at the System LevelInfrastructure-Supported Feedback LoopsApplications Consume; Systems Guarantee
About the Authors

Content preview from Agentic AI Data Architectures

Chapter 4. Operationalizing the AI Memory Layer

In modern AI stacks, applications exercise a variety of retrieval patterns to retrieve context dynamically. Those patterns give the illusion of flexibility, but they also abstract away the robustness of retrieval that cannot be left to application logic. To support agentic AI workloads at scale, the underlying layer must guarantee latency bounds, enforce governance, and ensure relevancy. This chapter shifts focus from “which pattern to choose” to how to operationalize distributed SQL as that reliable substrate. It shows how a distributed SQL database can fulfill the promise behind those retrieval patterns by letting applications simply ask, while the system reliably delivers.

Latency: Enforcing Speed Under Load

To the application user, retrieval should feel instantaneous, even under heavy concurrency. But applications cannot reliably enforce this. Instead, the infrastructure should guarantee millisecond-level response even as demand scales. Techniques such as index optimization, edge caching, and adaptive prefetching become essential infrastructure strategies. Latency comes in many forms, and each imposes distinct challenges that must be addressed at the system level:

Best-case (or average-path) latency: This is the time a request takes under favorable conditions, such as when caches hit, resources are idle, and the query is simple. It reflects the “happy path” performance users expect most of the time. Ensuring that this stays ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 0642572250140

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Agentic AI Data Architectures

by Blaize Stewart, Ed Huang

Chapter 4. Operationalizing the AI Memory Layer

Latency: Enforcing Speed Under Load

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.