Get a clear picture of what operations professionals do, what they're paid, how they’re seen within their companies, and how they rate different aspects of their jobs.
Five Questions for Camille Fournier about the challenges engineers face when transitioning to managers, and how to foster great technical leadership.
Watch highlights covering distributed systems, DevOps, resiliency, and more. From the O'Reilly Velocity Conference in San Jose 2017.
What to watch for in distributed systems, SRE, serverless, containers and more.
SRE calls for a unique blend of skills, which makes team building and hiring difficult. Learn how LinkedIn addressed these problems with their own SRE team.
Five Questions for Laine Campbell about building dependable databases.
Site Reliability Engineer is a job title we are starting to see more and more these days. What does it mean? Where does it come from? Learn from Google's SRE team.
A case study in how Google monitors its complex systems.
Join Safari. Get a free trial today and find answers on the fly, or master something new and useful.
Phil Stanhope shares Dyn’s experience with a major DDoS attack and explores the rapid evolution of multilayer attacks.
Learn how to analyze operations data in the presence of “holes” in the time series, how missing data impacts analysis, and a gamut of techniques that can be used to address the missing data issue.
Learn how Datadog integrated Consul into its environment. Darron Froese outlines mistakes made and lessons learned, plus tips for successful implementation in your own environment.
Learn how to use Kubernetes and Prometheus together to reimagine infrastructure and measure "the right things."
Learn the capabilities of Ansible, an agentless and extensible configuration management platform.
New York, NY
Building and maintaining complex distributed systems
Jez Humble is co-author of Lean Enterprise and Continuous Delivery (Addison-Wesley), the Jolt Award-winning book in Martin Fowler's signature series. ...