Skip to Content
Chaos Engineering
book

Chaos Engineering

by Casey Rosenthal, Nora Jones
April 2020
Intermediate to advanced
305 pages
8h 45m
English
O'Reilly Media, Inc.
Content preview from Chaos Engineering

Chapter 19. Chaos Engineering on a Database

Why Do We Need Chaos Engineering?

Ever since Netflix open sourced Chaos Monkey in 2011, this program has become more and more popular. If you want to build a distributed system, letting Chaos Monkey go a little crazy on your cluster can help build a more fault-tolerant, robust, and reliable system.1

TiDB is an open source, distributed, Hybrid Transactional/Analytical Processing (HTAP)2 database developed primarily by PingCAP. It stores what we believe is the most important asset for any database users: the data itself. One of the fundamental and foremost requirements of our system is to be fault-tolerant. Traditionally we run unit tests and integration tests to guarantee a system is production ready, but these cover just the tip of the iceberg as clusters scale, complexities amount, and data volumes increase by PB levels. Chaos Engineering is a natural fit for us. In this chapter, we will detail our practices and the specific reasons why a distributed system like TiDB needs Chaos Engineering.

Robustness and Stability

To build users’ trust in a newly released distributed database like TiDB, where data is saved in multiple nodes that communicate with each other, data loss or damage must be prevented at any time. But in the real world, failures can happen any time, anywhere, in a way we can never expect. So how can we survive them? One common way is to make our system fault-tolerant. If one service crashes, another ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Chaos Engineering

Chaos Engineering

Casey Rosenthal, Lorin Hochstein, Aaron Blohowiak, Nora Jones, Ali Basiri
Chaos Engineering

Chaos Engineering

Mikolaj Pawlikowski

Publisher Resources

ISBN: 9781492043850Errata Page