Infrastructure & Ops Superstream: Platform Engineering in the Age of AI
Published by O'Reilly Media, Inc.
Bridge the gap between AI innovation and platform reliability
AI is fundamentally transforming platform engineering—platform teams must now support entirely new workloads with unique infrastructure needs, unpredictable costs, and novel security concerns, while developers increasingly expect AI capabilities to be as accessible as traditional cloud services.
Join Sam Newman and a panel of experts as they discuss how platform engineers can evolve their IDPs to meet these demands without becoming bottlenecks to AI innovation. You’ll pick up practical strategies for making AI development accessible across your organizations while maintaining the operational excellence, security, and cost control that platform engineering promises.
What you’ll learn and how you can apply it
- How to keep your organization running smoothly without slowing your teams down
- Understand what good looks like with a successful platform build
- The influence platforms and AI are having on the SDLC
Recommended follow-up:
- Read Platform Engineering (book)
- Take Agentic AI in Platform Engineering (live online course with Ajay Chankramath)
Schedule
The time frames are only estimates and may vary according to how the class is progressing.
Sam Newman: Introduction (5 minutes)
Sam Newman welcomes you to the Infrastructure & Ops Superstream.
Platform Engineering for Agentic AI – Abdel Sghiouar (35 minutes)
Agentic AI isn't just a buzzword; it's a fundamental shift in software. These intelligent, goal-driven applications operate autonomously, presenting unprecedented challenges for infrastructure teams. The old ways of building and running apps simply won’t scale. Join Abdel Sghiouar, developer advocate at Google Cloud, to explore how the power of Kubernetes combined with modern platform engineering principles can create a scalable, resilient, and observable foundation for your next-generation intelligent applications.
Building an AI Agent: From Internal Tool to Platform Advantage – Jordan Lewis (Sponsored by Cockroach Labs) (30 minutes)
How do you operationalize AI tooling so that it’s reliable enough for daily use, secure enough to trust with internal data, cost-efficient enough to scale, and accessible enough that nontechnical teams can actually benefit from it? Jordan Lewis, VP of engineering at Cockroach Labs, shares his experience designing and deploying Mica, an internal AI agent that brings the power of LLMs to engineering, knowledge work, and enterprise-ready business workflows. He’ll take you through decisions that shaped Mica’s platform architecture, from managing LLM inference costs and latency to building guardrails that keep sensitive information internal, as well as how using a distributed database like CockroachDB made the process easier. Get a perspective on what operational excellence, security, compliance, centralized telemetry, and cost control look like when AI moves from proof of concept to production platform at the enterprise level.
This session will be followed by a 30-minute Q&A in a breakout room. Stop by if you have more questions for Jordan.
Break (5 minutes)
Platform Engineering for Developers, Architects & the Rest of Us – Daniel Bryant (35 minutes)
Join Daniel Bryant to explore the core goals of platform engineering for the software developer and architect communities. Understand “what good looks like” with a successful platform build and how a platform can influence the SDLC (for better or worse!).
Governance Without The Red Tape – Sarah Wells (35 minutes)
When you hear “governance,” you might think of red tape, bureaucracy, or someone telling you what you can’t do. But real governance is about alignment and reducing technical risk. And that matters more than ever. Join Sarah Wells to understand how to reduce risk, improve decision-making, and keep your organization running smoothly—without slowing your teams down.
Break (5 minutes)
The Fourth Signal: Why Evals Belong in Your Observability Stack – Ben O’Mahony (35 minutes)
Evals are the correctness signal that OTel doesn’t have. They’re not a test tool, they're the fourth pillar alongside logs, metrics, traces. Pass rate over time is your SLO. Error budget burn is eval regression. Join principal AI engineer Ben O’Mahony to understand why evals belong in your observability stack.
Fireside Chat with Nathen Harvey (35 minutes)
Join Sam Newman and Nathen Harvey, DORA team lead at Google Cloud, for this fireside chat around the role of platform engineering in the era of AI. Topics might include why AI necessitates a platform model, or how a strong internal platform can help unlock the impact of AI. Come prepared with questions to ask Nathen and Sam.
Sam Newman: Closing Remarks (5 minutes)
Sam Newman closes out today’s event.
Your Hosts and Selected Speakers
Sam Newman
Sam Newman is a technologist focusing on the areas of cloud, microservices, and continuous delivery—three topics which seem to overlap frequently. He provides consulting, training, and advisory services to startups and large multinational enterprises alike, drawing on his more than 20 years in IT as a developer, sysadmin, and architect. Sam is the author of the best-selling Building Microservices and Monolith to Microservices, both from O’Reilly, and is also an experienced conference speaker.
Abdel Sghiouar
Abdel Sghiouar is a senior cloud developer advocate at Google Cloud, cohost of the Kubernetes Podcast from Google, and a CNCF Ambassador. His areas of focus are GKE/Kubernetes, service mesh, and serverless. Abdel started his career in data centers and infrastructure in Morocco, his home country, before moving to Google’s largest EU data center in Belgium. He subsequently joined Google Cloud Professional Services in Sweden and spent five years working with Google Cloud customers to architect and design large-scale distributed systems before turning to advocacy and community work.
Jordan Lewis
Jordan Lewis is the VP of engineering at Cockroach Labs, joining in 2016 as a software engineer and employee #25. He is the creator of Mica, Cockroach Labs’ internal agent that gives employees a unified, trusted interface across workplace tools and services that is fundamentally changing how the company operates. Jordan is a New York native and passionate about database technologies.
Daniel Bryant
Daniel Bryant leads product marketing at Syntasso, where he helps platform teams understand and adopt modern internal developer platforms. With a background as a software architect and tool maker at companies like OpenCredo and Ambassador Labs, Daniel brings a unique blend of technical insight and strategic storytelling. He’s a long-time InfoQ editor, conference speaker, and coauthor of Mastering API Architecture (O’Reilly).
Sarah Wells
Sarah Wells is a technology leader, consultant, and conference speaker with a focus on microservices, engineering enablement, observability, and DevOps. She has over 20 years of experience as a developer, principal engineer, and tech director across product, platform, SRE, and DevOps teams. She spent over a decade working at the Financial Times as it transitioned from 12 releases a year to more than 20,000 and adopted the cloud, microservices, and DevOps.
Ben O'Mahony
Ben O’Mahony is Principal AI Engineer at Thoughtworks. He is a results-driven AI/Engineering leader with a track record of building high-performing teams and shipping business-critical AI, ML and data products and platforms at scale. He has deep expertise across the full Engineering and Data lifecycle from research to production deployment. Ben is adept at defining technical strategy, driving execution and partnering cross-functionally to deliver measurable impact. Recently Ben has been intensely focused on building Generative AI platforms, models and agents.
Nathen Harvey
Nathen Harvey leads the DORA team at Google Cloud. He leverages industry-shaping research to drive product strategy and help organizations improve software delivery speed, stability, and the developer experience. A frequent speaker on DevOps and AI, Nathen is dedicated to building solutions that empower technical communities. He co-authored multiple DORA reports and contributed to 97 Things Every Cloud Engineer Should Know, published by O’Reilly in 2020.
