Skip to Content
View all events

Advanced Harness Engineering

Published by O'Reilly Media, Inc.

Advanced content levelAdvanced

Design and build agentic workflow and deep research harnesses

What you’ll learn and how you can apply it

  • Design a durable workflow harness that combines checkpointing, idempotent task execution, approval gates, and auditable execution logs
  • Build a workflow agent in LangGraph that persists state, resumes from checkpoints after a simulated failure, and uses procedural and semantic memory for SOPs and business rules
  • Design a deep research harness with an orchestrator-worker architecture, a shared evidence pool, and a memory consolidation pipeline
  • Build a parallel multi-agent deep research system that accumulates knowledge across sessions and demonstrates measurable improvement session over session

Course description

Workflow agents and deep research agents are two of the hardest application modes to get right in production. Workflow agents need to run for minutes or hours, survive failures, persist state across interruptions, and produce audit trails that hold up under review. Deep research agents need to coordinate parallel subagents, share evidence across a growing knowledge base, consolidate findings into durable knowledge, and resolve contradictions before they compound. The harness requirements for each are distinct, and the architectural choices you make early tend to determine whether the system can scale or quietly collapse as tasks get longer.

This two-hour course with Richmond Alake applies a memory-first, component-driven approach to the workflow and deep research application modes. You’ll design a durable workflow harness using LangGraph-style checkpointing, encode SOPs and business rules into the appropriate memory types, and build approval gates that keep a human in the loop without breaking determinism. You’ll then design a parallel, multi-agent deep research harness with an orchestrator and worker agents, a shared evidence pool, and a memory-consolidation pipeline that turns episodic traces into durable semantic knowledge.

This live event is for you because...

  • You’re a software engineer, ML engineer, or AI engineer who has built agent prototypes and is taking on longer-running or higher-stakes workloads.
  • You’re a technical lead or architect designing workflow automation or research tooling that needs to run reliably over hours or days.
  • You work with LangGraph, CrewAI, LlamaIndex, the Claude Agent SDK, or similar frameworks and want to apply harness engineering principles to multi-agent systems.
  • You want to move beyond single-shot agents and build systems that run reliably over long horizons.

Prerequisites

  • A Python 3.11+ environment with a recent agent framework installed (LangChain, LangGraph, or the Claude Agent SDK)
  • An API key for at least one frontier model provider (Anthropic, OpenAI, or equivalent)
  • A local database available for memory persistence (Oracle AI Database)
  • A code editor with notebook support, and Git installed for cloning the course repository (shared before the session)
  • Working knowledge of Python, including async patterns and basic API development
  • Familiarity with LLM APIs and prompt-level agent behavior (tool calling, system prompts, structured outputs)
  • Familiarity with at least one agent framework (LangGraph preferred)
  • Comfort with graph-based state management (helpful but not required)
  • Exposure to vector search and semantic retrieval concepts

Recommended follow-up:

Schedule

The time frames are only estimates and may vary according to how the class is progressing.

Building the workflow mode agent harness (50 minutes)

  • Presentation: Why workflow mode needs durable execution; how checkpointing differs from runtime-managed durability; when sequential, orchestrator-worker, or parallel architectures fit; encoding SOPs and business rules into procedural and semantic memory; treating step outputs as checkpoint writes that enable resumability; sandbox design for workflows, covering idempotent tasks, deterministic replay, tool scope boundaries, and approval gates with auditable logs built on episodic memory
  • Hands-on exercise: Build a compliance reporting workflow with LangGraph checkpointing, a human approval gate, and the four memory types wired in; trigger a mid-pipeline failure and resume from the last persisted checkpoint
  • Break

Building the deep research mode agent harness (50 minutes)

  • Presentation: Why deep research needs a parallel multi-agent architecture; when multi-agent systems help and when they amplify errors; designing the shared evidence pool as a memory substrate, covering provenance, deduplication, and contradiction detection; memory consolidation pipelines that turn episodic traces into durable semantic knowledge; context engineering across a growing knowledge store, recursive subtask decomposition, semantic caching for search and synthesis, and human-in-the-loop steering for research direction
  • Hands-on exercise: Build a competitive intelligence harness with an orchestrator agent, parallel worker agents, a shared evidence pool, and a consolidation pipeline; run two sessions back to back and measure how first-session findings improve the second

Wrap-up and Q&A (10 minutes)

Your Instructor

  • Richmond Alake

    Richmond Alake is a highly experienced Machine Learning Architect and Engineer with over five years of expertise in the field. He specializes in Computer Vision and Deep Learning and has a proven track record of successfully developing and integrating deep learning models to solve a wide range of problems, such as motion detection, object detection, and pose estimation. Throughout his career, he has worked with a diverse range of clients, including large conglomerates, financial institutions, and small startups. In addition to his professional work, Richmond also serves as an AI advisor to a number of startups in the UK and the US.

    With a background in building websites and mobile applications, Richmond is a firm believer in using technology to solve everyday problems. He has extensive knowledge of Machine Learning and has written over 200 articles on the subject, gaining over a million views. He was recognized as one of Medium's top AI writers in 2020/2021 and has collaborated with companies such as O'Reilly, BuiltIn and Nvidia to develop effective educational and informative learning materials on AI.

    Currently, Richmond Alake is a Machine Learning Architect at Slalom Build UK. As the first hire of the machine learning practice in the UK division, he is responsible for helping organizations move from machine learning research to productionisation and assisting maturing organizations in promoting AI models into existing infrastructure to drive commercial and business value. His main role as an ML Architect is to assist organizations in developing and maintaining machine learning pipelines by implementing MLOps principles, techniques, and tooling. He is well-versed in Feature Stores and has conducted internal training for Data Engineers, Data Scientists, and ML Engineers.

    linkedinXlinksearch

Skill covered

Engineering