Skip to Content
Chaos debugging: Finding and fixing microservice abnormalities
conference

Chaos debugging: Finding and fixing microservice abnormalities

by Mitchell Kelley, Scott Cranton
December 2019
Intermediate
39m
English
O'Reilly Media, Inc.
Closed Captioning available in German, English, Spanish, French, Japanese, Korean, Portuguese (Portugal, Brazil), Chinese (Simplified), Chinese (Traditional)

Overview

Building microservices applications introduces more complexity into your architecture. Highly distributed applications on elastic, ephemeral infrastructure that communicate heavily over the network make for an environment where an application is always in a fluid, partially failing state at all times. To help developers transition from the monolithic way of designing and building software to a more service-oriented approach, we need to bridge the gap in tooling to help diagnose and understand what a normal state looks like and how to recover from a non-normal state.

Mitchell Kelley and Scott Cranton (solo.io) discuss the types of failures that can occur, namely networking, application behavior/code, and storage, and present a systemic workflow for prodding and exploring a system to detect faults and abnormal behavior. This framework builds on the practices known as chaos engineering. Mitchell and Scott take a look at two open source projects that aim to complement this workflow: the Squash project, for step-by-step distributed microservices debugging, and Gloo Shot, a newly created chaos engineering framework.

Prerequisite knowledge

  • Familiarity with debugging applications and building distributed applications
  • A basic understanding of services-oriented applications

What you'll learn

  • Understand chaos engineering
  • Learn how to debug distributed applications with appropriate tooling
  • Investigate the requisite workflow to apply toward system behavior exploration

This session was recorded at the 2019 O'Reilly Open Source Software conference in Portland.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Watch now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Distributed event-driven services: From the trenches

Distributed event-driven services: From the trenches

Premanand Chandrasekaran

Publisher Resources

ISBN: 0636920335689