A hands-on demonstration for implementing tracing in modern applications that introduces tracing through the CNCF’s OpenTracing project.
Understanding gRPC in the dawn of microservices.
Five questions for Bryan Liles on the complexities of tracing, recommended tools and skills, and how to learn more about monitoring.
Understanding the impact and expanding influence of DevOps culture, and how to apply DevOps principles to make your digital operations more performant and productive.
Building and tuning traffic management for large web-scale applications.
Five questions for Brendan Burns: How containers and cluster management have changed systems development, and common patterns for building distributed systems.
Five questions for VM Brasseur and Deb Nicholson: How to overcome communication challenges and encourage collaborative work when managing globally distributed teams.
Five questions for Tanya Reilly: How service interdependencies make recovery harder and why it’s a good idea to deliberately and preemptively manage dependencies.
Five questions for Cynthia Savard Saucier on how design impacts Shopify’s business outcomes, and tips for improving designer-developer communication.
Five questions for Joseph Breuer and Robert Reta on managing dependencies, building for adaptability, and managing through change.
Dianne Marsh asks: What does relevance in tech look like in practice?
Dave Andrews shares strategies for addressing cascading failures at various scales and environments.
Tammy Everts offers a brief history of UX and web performance research, and shares educated guesses about new metrics.
Kelsey Hightower discusses the benefits of using Kubernetes and Istio together.
Phillip Liu says removing friction for engineers leads to better code and a better understanding of how code works.
Cliff Crocker shows how intelligent analytics connects the data from "finders" with the actions from "fixers."
Dharma Shukla covers Azure Cosmos DB, a massively scalable, multitenant, globally distributed database service.
Artur Bergman looks back at the last decade of DevOps and explores shifting patterns in operations, development, and systems.
Kristopher Beevers asks: What does resiliency mean when service providers are critical components of nearly every application?
Watch highlights covering distributed systems, DevOps, resiliency, and more. From the O'Reilly Velocity Conference in San Jose 2017.
Peter Alvaro explores lineage-driven fault injection (LDFI), a novel approach to automating failure testing.
Martin Woodward tells the story of transforming Microsoft’s internal engineering systems from a collection of disparate in-house tools to One Engineering System.
Dawn Parzych shows how understanding assumptions and biases can help your organization.
Adam Jacob says we need to design organizations that can cope with what’s new and what’s next.
Corey Scobie explains why the compartmentalization of Internet technology and application development will not sustain our aspirations.
How can you be an effective noncoding technical leader? Camille Fournier explores solutions to this ongoing issue.
Alex Grbic explains how a single field-programmable gate array (FPGA) can deliver acceleration for multiple workloads.
David Hayes explains why DevOps is now a requirement for success and he outlines challenges all DevOps teams will face over the next five years.
Get a clear picture of what operations professionals do, what they're paid, how they’re seen within their companies, and how they rate different aspects of their jobs.
Five Questions for Dinesh Dutt on the changing relationship between network and computer.
Five Questions for Laine Campbell about building dependable databases.
Five Questions for Camille Fournier about the challenges engineers face when transitioning to managers, and how to foster great technical leadership.
The O’Reilly Podcast: Larry Haig discusses how to create a performance culture.
Five questions for Ben Sigelman about adopting distributed tracing tools and optimizing performance of complex, distributed systems.
Five questions for Brendan Gregg about improving the performance of Linux systems.
Bridge the operational silos between web application architecture, system administration, and teams with DC/OS.
Five questions for Lee Calcote on the scaling and performance of container architectures.
Five questions for Emil Stolarsky on moving logic to the edge of your systems.
Five questions for Gwen Shapira about how Kafka can enable business agility.
Five questions for Laura Frank about orchestration, security, and beyond.
Five Questions for Sebastien Goasguen about Kubernetes and the cloud native tools that support it.
SRE calls for a unique blend of skills, which makes team building and hiring difficult. Learn how LinkedIn addressed these problems with their own SRE team.
Learn how good load balancers can distribute load, add security, and maintain performance and flexibility in web application delivery.
How to approach continuous deployment at your own organization.
Start by identifying change agents in your organization.
Learn how to avoid these common application performance testing mistakes.
7 common mistakes that keep teams from achieving optimum mobile performance, and what to do about them.
HTTP/2 is still new and, although deploying it is relatively easy, there are a few things to be on the lookout for when enabling it.
Results from the O’Reilly Cloud Platform Survey.
What to watch for in distributed systems, SRE, serverless, containers and more.
The focus of the O'Reilly Velocity Conference is shifting from system administration to systems engineering.
The O’Reilly Podcast: Stephen Ludin discusses how to get started with HTTP/2.
The O’Reilly Podcast: Sonia and Sabrina Burney discuss how to provide a secure and fast customer experience
Eliminate the middleware tier and directly communicate with back-end APIs for better security, lower cost, and greater speed.
Incident management experts explain how to quickly restore service and prevent future outages.
The O’Reilly Podcast: Jason Hand discusses how to get your team started with ChatOps.
The O’Reilly Podcast: Nathan Moore discusses caching, CDNs, and scaling front end security and performance.
Mike McGarr and Andrew Spyker explain the potential containers have to help Netflix create a more productive development experience while simultaneously deepening its control over resource management.
How replacing costly mainframes with virtual assets speeds the testing process, reduces time to market, and increases agility.
Phil Stanhope shares Dyn’s experience with a major DDoS attack and explores the rapid evolution of multilayer attacks.