Skip to Content
View all events

AI Agents Bootcamp

Published by O'Reilly Media, Inc.

Intermediate content levelIntermediate

Designing and deploying enterprise agentic systems

What you’ll learn and how you can apply it

  • Learn the core mental models for agent design and how they map to frameworks such as LangGraph and CrewAI
  • Use LangGraph to build single-agent and multi-agent systems with typed states, tool use, memory, and conditional decision loops
  • Understand and apply collaboration and communication patterns in multi-agent systems
  • Learn how to transform unstructured, semi structured, and multimodal inputs into agent-ready structured data using schema-based prompting, Pydantic validation, and MCP-driven communication patterns
  • Select appropriate model types and agent roles for planning, retrieval, tool use, and fast intra-agent communication
  • Design memory and context strategies that support both individual agent reasoning and group-level coordination
  • Apply guardrails, secure tool execution environments, and design threat-aware workflows

Course description

In this two-day bootcamp, Nicole Koenigstein shares a clear, four-phase strategy for agent design: build a simple, stateful agent with basic tools; add intelligence with structured reasoning and human-in-the-loop safeguards; scale up to multi-agent systems using supervisor, hierarchical, or swarm setups; and apply production pillars such as optimization, context strategy, model architecture, monitoring, and cybersecurity.

As you go through the phases and perform the exercises, you’ll begin to think like an agent designer, build your own multi-agent system, and acquire a repeatable approach to breaking down new problems into clear, well-structured agent and multi-agent designs.

This live event is for you because...

  • You’re a machine learning engineer, software engineer, or technical architect who wants to design more reliable and interpretable agentic systems.
  • You want to understand how to map complex tasks to agent workflows and how to use frameworks such as LangGraph.
  • You’re building or planning to build multi-agent systems and want a clear understanding of collaboration and communication patterns.
  • You want to improve the stability, performance, and maintainability of your existing agent pipelines.
  • You need practical guidance on model selection, memory design, and workflow coordination for real-world applications.
  • You’re preparing to deploy agents in production and want to ensure they behave predictably and are easy to debug, monitor, and test.

Prerequisites

  • A Python 3.12 environment set up on your computer (best in Google Colab)
  • Dependencies from GitHub repository installed (link to come)
  • An API key for OpenAI and for Novita AI (https://novita.ai/) or Nebius (https://nebius.com/) to use open source models
  • A basic understanding of how LLMs work at a conceptual level
  • Experience interacting with LLMs through an API and working with model parameters, prompts, and responses
  • Intermediate Python programming skills, including writing functions, using classes, and working with common libraries
  • Basic familiarity with JSON and common developer tools like Docker or yaml

Recommended preparation:

Recommended follow-up:

Schedule

The time frames are only estimates and may vary according to how the class is progressing.

DAY 1: Foundations of Agent Design and Multi-Agent Collaboration

Foundations of agent design and task decomposition (60 minutes)

  • Presentation: State machines, structured state transitions, and how to translate real-world tasks into agent workflows using LangGraph
  • Demo: Transforming a stateless LLM into a stateful agent with basic tool use
  • Hands-on exercise: Decompose a given small task and implement it as a LangGraph workflow with state and one tool
  • Q&A
  • Break

Making the agent smarter (60 minutes)

  • Presentation: Adding intelligence and optimization at test time using structured reasoning patterns such as chain of thought and ReAct
  • Demo: Implementing chain of thought and ReAct for an agent from scratch
  • Hands-on exercise: Compare a simple single agent to a planner; judge setup and observe how structured reasoning and test-time compute affect the quality of the outcome
  • Q&A

Adding human oversight to your system (60 minutes)

  • Presentation: How much autonomy agents have in practice; implementing safeguards with human-in-the-loop patterns
  • Demo: Adding human-in-the-loop checkpoints in LangGraph to approve or reject actions, edit graph state, and request additional human input before sensitive steps
  • Hands-on exercise: Insert a human-in-the-loop gate into your own agent workflow
  • Q&A
  • Break

From single agents to multi-agent systems (60 minutes)

  • Presentation: Scaling from a single, tool-using agent to a multi-agent system using collaboration patterns; when to apply each
  • Demo: Building a supervisor-based multi-agent system in LangGraph where a coordinator agent delegates tasks to specialist agents
  • Hands-on exercise: Turn your existing agent into a two-agent workflow and test how responsibilities and handoffs change system behavior
  • Q&A
  • Break

Communication patterns in multi-agent systems (60 minutes)

  • Presentation: How multi-agent systems coordinate through message passing, shared memory access patterns, and event-driven communication; when to choose each method; how communication shapes collaboration patterns, latency, handoff reliability, and system throughput
  • Demo: How different multi-agent collaboration patterns work together with communication patterns; which ones to combine for which use case
  • Hands-on exercise: Exchange a communication pattern in a predefined MAS and observe how it affects agent coordination and communication
  • Q&A

DAY 2: Production-Ready Multi-Agent Systems

From unstructured and multimodal inputs to structured agent-ready data (60 minutes)

  • Presentation: Optimizing agents for structured, unstructured, and multimodal data; aligning their outputs through schema-based prompting with Pydantic validation; enabling stable multi-agent communication using the Model Context Protocol (MCP)
  • Demo: Transforming unstructured data retrieved from an MCP server into agent-digestible structured data
  • Hands-on exercise: Take a free-form task description and define a minimal schema, configuration, and prompt so that an agent can consistently turn it into structured data for a downstream tool or coding workflow
  • Q&A
  • Break

Memory design and shared context strategies (60 minutes)

  • Presentation: How episodic memory preserves task-level context; how procedural memory enables routines that can be reused across tasks; capturing higher-level reasoning patterns through long-term memory structures; determining what information should be stored, when memory should be updated, and how retrieval shapes subsequent decisions; checkpointing as a mechanism for preserving state
  • Demo: A LangGraph workflow that maintains episodic and procedural memory and uses checkpoints; how memory is read and written during execution; how time travel allows stepping back to an earlier state to inspect the reasoning path
  • Hands-on exercise: Extend your agent by adding a simple episodic memory that stores each tool result and a procedural memory entry that records a small reusable routine
  • Q&A
  • Break

Choosing the right models and architectural patterns for agent reasoning (60 minutes)

  • Presentation: How architectural decisions shape the capability, speed, and stability of an agentic system; comparing planner executor setups, supervisor based coordination, and unified agent designs; assigning models to roles based on their strengths; when to use thinking or no thinking inference modes; KV cache behavior; context engineering, and the need for stable prefixes when designing long running workflows
  • Demo: Comparing two LangGraph architectures, one using a planner with a thinking mode enabled and one using a unified agent with no thinking mode
  • Hands-on exercise: Modify a predefined workflow by switching the planning component between thinking and no thinking modes and observe the change in latency and decision quality; run the same workflow with a dense model and a smaller MoE model to compare behavior
  • Q&A
  • Break

Observability, monitoring, and benchmarking (60 minutes)

  • Presentation: Observing, measuring, and evaluating agent behavior in production; building task-specific benchmarks for multi-agent systems
  • Demo: Monitoring and evaluating an agent workflow with monitoring hooks, logging, and a custom evaluation loop
  • Hands-on exercise: Add tracking to your workflow with one simple logging function that prints each state transition; run the workflow twice with small changes in the input and compare the differences in the logged steps to evaluate stability
  • Q&A
  • Break

Security, guardrails, and secure execution environments (50 minutes)

  • Presentation: The agentic threat landscape (reasoning manipulation, memory poisoning, tool misuse, communication corruption), safeguarding coding agents
  • Demo: Securing an agentic workflow using OmegaConf-driven security policies, Pydantic schema validation, MCP tool governance, and PRP-based coding constraints
  • Group discussion: Review a small predefined scenario involving an unsafe or ambiguous agent action and discuss where the vulnerabilities lie, what should be validated, and what mitigation strategy is most appropriate

Wrap-up and Q&A (10 minutes)

Your Instructor

  • Nicole Koenigstein

    Nicole Koenigstein is an AI researcher and practitioner in agentic systems, working across research, consulting, teaching, and direct system implementation to build reliable, production-ready AI systems. Her work focuses on multi-agent architectures, evaluation, safety, and long-term system behavior. She’s the author of Math for Machine Learning and Transformers in Action (Manning Publications). Her forthcoming books, Transformers: The Definitive Guide and Applications Beyond NLP and AI Agents: The Definitive Guide, will be published by O’Reilly Media. Nicole served as an external evaluator for a European Commission AI Grand Challenge and has advised IOSCO on generative AI in regulated environments. She also serves on advisory boards for leading AI and quantitative finance conferences and regularly delivers invited talks and technical workshops across academia, industry, and international events.

Skills covered

  • Generative AI
  • AI Agents