Get a basic understanding of distributed systems and then go deeper with recommended resources.
Martin Kleppmann shows how recent computer science research is helping develop the abstractions and APIs for the next generation of applications.
Claire Janisch looks at some of the best biomimicry opportunities inspired by nature’s software and wetware.
Kris Nova looks at the new era of the cloud native space and the kernel that has made it all possible: Kubernetes.
Jane Adams examines the ways data-driven recruiting fails to achieve intended results and perpetuates discriminatory hiring practices.
Crystal Hirschorn discusses how organizations can benefit from combining established tech practices with incident planning, post-mortem-driven development, chaos engineering, and observability.
Watch highlights from expert talks covering Kubernetes, chaos engineering, deep learning, and more.
Omoju Miller outlines a vision where we harness human action for a better future.
Anne Currie says excessive and dirty energy use in data centers is one of the biggest ethical issues facing the tech industry.
Katrina Owen says the valuable skills that experienced professionals lack are at the vital margins of their careers.
O’Reilly’s new survey reveals the latest operations salary trends, and the skill sets that will keep your operations career on track.
This collection of serverless resources will get you up to speed on the basics and best practices.
A new report examines the state of infrastructure and anticipated near-term developments through the eyes of infrastructure experts.
Michael Bernstein offers an unflinching look at some of the fallacies that developers believe about marketing.
Roger Magoulas shares insights from O'Reilly's online learning platform that point toward shifts in the systems engineering ecosystem.
Tammy Butow explains how companies can use Chaos Days to focus on controlled chaos engineering.
Laura Thomson shares Mozilla’s approach to data ethics, review, and stewardship.
Jaana Dogan explains why Google teaches its tracing tools to new employees and how it helps them learn about Google-scale systems end to end.
Anil Dash asks: How could our processes and tools be designed to undo the biggest bugs and biases of today’s tech?
Francesc Campoy Flores explores ways machine learning can help developers be more efficient.
Kris Beevers examines the trade-offs between risk and velocity faced by any high-growth, critical path technology business.
Jessica McKellar draws parallels between the free and open source software movement and the work to end mass incarceration.
Dave Rensin explains why DevOps and SRE make each other better.
Laurent Gil shares the latest cybersecurity research findings based on real-world security operations.
Watch highlights from expert talks covering DevOps, SRE, security, machine learning, and more.
Kavya Joshi says performance theory offers a rigorous and practical approach to performance tuning and capacity planning.
Using advanced Docker Compose features to solve problems in larger projects and teams.
Poll results reveal where and why organizations choose to use containers, cloud platforms, and data pipelines.
Get a basic understanding of site reliability engineering (SRE) and then go deeper with recommended resources.
Achieve high-impact systems monitoring by focusing on latency, errors, throughput, utilization, and blackbox monitoring.
Get advice and insight from speakers who have tackled the challenges you face.
O’Reilly Media Podcast: George Miranda discusses the benefits and challenges of a service mesh, and the best ways to get started using one.
Learn why this new tool is a critical component in microservice-based architectures.
David Hayes explains why adding a manageable dose of actionable intelligence to your operations management workflow can save you time and aggravation.
Kyle Kingsbury explores anomalies in three distributed systems and shares strategies for correctness testing using Jepsen.
Bryan Liles explains how to evaluate and integrate new declarative application management practices into continuous integration pipelines.
Julia Grace shares how she learned to rapidly scale herself and her leadership team during a period of hypergrowth at Slack.
Dave Andrews explains how to wield the power of a global 50 Tbps application delivery network to ensure maximum availability during and after a change.
Nicole Forsgren shares results and stories behind high-performing technology-driven teams and organizations.
Oracle's Kyle York and Netra's Richard Lee discuss Netra’s high-performance computing environment.
Renee Orser explains how to monitor the human networks within your engineering teams using models similar to your distributed technology systems.
Astrid Atkinson discusses techniques for building systems that are resilient by design.
Kyle York explores the scale, complexity, and volatility of the internet and the risk it poses to your applications and infrastructure.
Martin Woodward shares key data points from Microsoft's journey to DevOps.
Javier Garza details the ingredients you need to build and deliver an app your users will love.
Tamar Bercovici details how the team at Box has constructed its database stack to handle an ever-growing query load and data set.
Kris Nova looks at the four metrics that help you decide if running stateful applications in Kubernetes is worth the risk.
Watch highlights covering infrastructure, DevOps, security, and more. From the O'Reilly Velocity Conference in San Jose 2018.
Natalie Silvanovich discusses the link between feature complexity, developer error, and security vulnerabilities.
Recipes that deal with various aspects of troubleshooting, from debugging pods and containers, to testing service connectivity, interpreting a resource’s status, and node maintenance.
The O'Reilly Velocity Conference in San Jose will cover what you need to know to build high-performance, resilient, and secure systems.
The O’Reilly Fluent and Velocity conferences are teaming up to create a unique learning opportunity that addresses the full web experience.
An outside-the-box exploration of how containers can be used to provide novel solutions.
Systems and site reliability engineers, architects, and application developers must create new strategies to meet industry shifts and their constraints.
This collection of DevOps resources will get you up to speed on the basics, best practices, and latest techniques.
The O’Reilly Podcast: Modern day DNS for hybrid cloud, intelligent traffic steering, and DevOps.
How edge networks, Kubernetes, serverless and other trends will shape systems engineering and operations.
Lessons learned from building engineering teams under pressure.
Mike Strickland says a new approach to data analytics acceleration is delivering benchmarked performance increases of 3X to 10X+ at the system level for traditional relational and NoSQL databases.
Kolton Andrus explores the evolution of chaos engineering and explains why it’s becoming the go-to approach for building resilient systems.