An outside-the-box exploration of how containers can be used to provide novel solutions.
Systems and site reliability engineers, architects, and application developers must create new strategies to meet industry shifts and their constraints.
This collection of DevOps resources will get you up to speed on the basics, best practices, and latest techniques.
The O’Reilly Podcast: Modern day DNS for hybrid cloud, intelligent traffic steering, and DevOps.
How edge networks, Kubernetes, serverless and other trends will shape systems engineering and operations.
Lessons learned from building engineering teams under pressure.
Kolton Andrus explores the evolution of chaos engineering and explains why it’s becoming the go-to approach for building resilient systems.
Kavya Joshi explores strategies for preparing systems for flux and scale.
Edge computing is a hot topic, but Tyler McMullen says major hurdles need to be overcome before it reaches its full potential.
Mike Strickland says a new approach to data analytics acceleration is delivering benchmarked performance increases of 3X to 10X+ at the system level for traditional relational and NoSQL databases.
Catherine Mulligan discusses the implications of blockchain on distributed systems and what needs to be addressed to build and maintain these systems effectively.
Laura Hackney looks at the pitfalls and successes of the movement to bring social justice work into the technology landscape.
Liz Rice considers the questions organizations must answer before going cloud native.
Watch highlights covering DevOps and systems engineering. From the O'Reilly Velocity Conference in London 2017.
Miriah Meyer explores how interactive visualizations can help us find meaning in mounds of data.
Christopher Meiklejohn is building his startup with Martinelli, a new programming language that provides fault-tolerant, high-scalability operation.
Sara-Jane Dunn discusses an entirely different paradigm of computing: the information processing carried out by cells.
Guy Podjarny on why open source security is a community responsibility.
Learn how Netflix scales microservices with application data caching.
Learn the core principles of Google site reliability engineering.
David Woods and Richard Cook offer a glimpse at the SNAFUcatchers Stella Report.
Robert Castley explores the relevance of real-user data if real users are blocking RUM tags, and he shares some solutions.
Lara Hogan walks through tactics you can employ to be a sponsor for those around you.
Craig Adams explores the traditional DevOps pipeline, addresses how to think about CDN automation, and explains how Akamai is baking automation into its CDN.
For most of us, the best approach to scaling complex distributed systems is to not do that. So, Nick Rockwell asks, why isn’t serverless a bigger deal?
Jessica Frazelle and Dino Dai Zovi discuss how to be effective at open source in your company.
Developers spend huge amounts of time fixing bugs in their programs, but what about automatically fixing them? Claire Le Goues shares recent advances that aim to make that dream a reality.
Bitcoin showed us a new way of moving value around the internet. Neha Narula considers how this paradigm might apply to databases that cross organizational boundaries.
Carin Meier explores new ways to approach systems and tame complexity.
Matt Cutts discusses how better technology can improve not just software systems but also trust in government itself.
Joe Goldberg explores jobs as code, which looks at batch application automation from a systems development life cycle perspective.
Kristopher Beevers explains how to augment Incident Command with simple tools and processes, such as basic checklists or regular fire drills.
"Do no harm" is a core principle in medicine. Cynthia Savard Saucier challenges the tech industry to come up with its own fundamental principle.
Watch highlights covering complex distributed systems, systems engineering, DevOps, and more. From the O'Reilly Velocity Conference in New York 2017.
Rob Claire introduces the monitoring tools Pinterest uses and offers real-world examples of problem solving with data monitoring.
Building confidence in system behavior through experiments
A hands-on demonstration for implementing tracing in modern applications that introduces tracing through the CNCF’s OpenTracing project.
Understanding gRPC in the dawn of microservices.
Five questions for Bryan Liles on the complexities of tracing, recommended tools and skills, and how to learn more about monitoring.
Understanding the impact and expanding influence of DevOps culture, and how to apply DevOps principles to make your digital operations more performant and productive.
Building and tuning traffic management for large web-scale applications.
Five questions for Brendan Burns: How containers and cluster management have changed systems development, and common patterns for building distributed systems.
Five questions for VM Brasseur and Deb Nicholson: How to overcome communication challenges and encourage collaborative work when managing globally distributed teams.
Five questions for Tanya Reilly: How service interdependencies make recovery harder and why it’s a good idea to deliberately and preemptively manage dependencies.
Five questions for Cynthia Savard Saucier on how design impacts Shopify’s business outcomes, and tips for improving designer-developer communication.
Five questions for Joseph Breuer and Robert Reta on managing dependencies, building for adaptability, and managing through change.
Kelsey Hightower discusses the benefits of using Kubernetes and Istio together.
Cliff Crocker shows how intelligent analytics connects the data from "finders" with the actions from "fixers."
Dianne Marsh asks: What does relevance in tech look like in practice?
Phillip Liu says removing friction for engineers leads to better code and a better understanding of how code works.
Dave Andrews shares strategies for addressing cascading failures at various scales and environments.
Dharma Shukla covers Azure Cosmos DB, a massively scalable, multitenant, globally distributed database service.
Tammy Everts offers a brief history of UX and web performance research, and shares educated guesses about new metrics.
Corey Scobie explains why the compartmentalization of Internet technology and application development will not sustain our aspirations.
Dawn Parzych shows how understanding assumptions and biases can help your organization.
Martin Woodward tells the story of transforming Microsoft’s internal engineering systems from a collection of disparate in-house tools to One Engineering System.
Artur Bergman looks back at the last decade of DevOps and explores shifting patterns in operations, development, and systems.
Adam Jacob says we need to design organizations that can cope with what’s new and what’s next.
Watch highlights covering distributed systems, DevOps, resiliency, and more. From the O'Reilly Velocity Conference in San Jose 2017.
Kristopher Beevers asks: What does resiliency mean when service providers are critical components of nearly every application?