Break me if you can: A practical guide to building fault-tolerant systems

Video description

You built your system, you deployed it, you rolled it up in production, but it’s just the beginning. The life of your system just started. It will grow, evolve, and wake you up in the middle of the night. Usually, at this point you start thinking about fault tolerance and error handling. Fault-tolerance concepts sound simple: modern frameworks promise to effortlessly solve it for you.

But what’s hiding behind the simplicity? Alex Borysov (Google) and Mykyta Protsenko (Netflix) take you along for a sneak peak at how to design and build truly fault-tolerant Java systems. They make it real by trying failure scenarios against a live system (you’ll watch it recover in real time) and then review the recipes (with gRPC and REST examples and a number of open source tools) that you can use right away to make your code more resilient and your system more robust.

Prerequisite knowledge

  • A basic understanding of microservice architecture

This session was recorded at the 2019 O'Reilly Open Source Software conference in Portland.

Product information

  • Title: Break me if you can: A practical guide to building fault-tolerant systems
  • Author(s): Alex Borysov, Mykyta Protsenko
  • Release date: December 2019
  • Publisher(s): O'Reilly Media, Inc.
  • ISBN: 0636920335542