Chapter 17. Let’s Get Cyber-Physical
Much of the attention around Chaos Engineering currently focuses on the creation, deployment, and management of infrastructure and applications that underpin various internet-centric products and services, such as Netflix, AWS, and Facebook. These are domains where innovative, large-scale, complex, and interconnected software-driven systems dominate the landscape.
The goal of Chaos Engineering in these IT and cloud-dominated ecosystems is the discovery and removal of questions about system behavior in the pursuit of achieving more reliable and robust systems. By applying Chaos Engineering practices to complex systems you’re attempting to learn where you have blind spots. Sometimes this takes the form of discovering you can’t actually detect when something has gone wrong. Other times you are discovering that your assumptions about your system’s behavior don’t hold up at all. You may find you are reverse-engineering your implementation’s real model and finding out just how far away it is from the design you imagined.
In this chapter we will explore four topics:
-
Facets of traditional Functional Safety practice
-
Where there is overlap between Functional Safety and Chaos Engineering (e.g., performing Failure Mode and Effects Analysis)
-
Where Functional Safety practices leave a lot of room for improvement with respect to the new generations of software-intensive systems currently being developed
-
Where Chaos Engineering principles ...