book

AI Agents: The Definitive Guide

by Nicole Koenigstein

November 2026

Intermediate to advanced

350 pages

2h 22m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Finite and Hierarchical State Machines: The Base Paradigm of AgentsFinite State MachinesHierarchical State MachinesMapping FSM/HSM to Agent FrameworksFoundations: From Static Models to Dynamic AgentsGiving the LLM a StateDeus Ex Machina: What is Possibly Right Now?Conclusion
Structured Reasoning and ActionLetting Your AI Agents Think Out LoudGenerating a Tree of PossibilitiesHow to Make Your Agent Take Action and ReflectSurely You’re Joking, AI Agent!Designing Human-in-the-Loop WorkflowsAdvanced Agent Paradigms: Supervisor and Hierarchical AgentsBuilding a Hierarchical Research TeamDeveloping a Swarm of AgentsConclusion
Beyond Zero Sum Games: From Rewards to ReasoningBetter in groups: why relative ranking mattersFrom token-based rewards to sequence-based learningTaking Off the Training Wheels: Teaching Agents How to LearnBeyond isolation: why absolute scoring is easierThe ART of learning from experienceTest Time Compute: Balancing Size with ThoughtFrom sampling to search: balancing exploration and exploitationAdaptive Branching Monte Carlo Tree Search (AB-MCTS)Enabling General-Purpose Agentic Programs through Post-Training and Search-Based AgentsConclusion

Content preview from AI Agents: The Definitive Guide

Chapter 3. Advanced Planning, Reasoning, and Scalable Execution in Agents

In the last two chapters, you saw that AI agents aren’t magic, they’re engineered systems. But the techniques you’ll learn in this chapter may start to feel close. As Arthur C. Clarke once said:

Any sufficiently advanced technology is indistinguishable from magic. ¹

That sense of magic comes from what happens when agents stop reacting and instead start learning from their own experience or making smarter choices at test time. They are no longer non-player characters (NPCs), because they start to learn and adapt on their own when you apply the principles of reinforcement learning (RL) to LLMs. This is how Clarke’s quote finds new meaning in the world of AI agents.

But what may look uncanny at first, actually comes from a set of clear mechanisms. This chapter explains those mechanisms in depth: you’ll learn how RL builds a feedback loop between reasoning and outcomes, how tree-based search and adaptive planning lets agents ...