Skip to Content
Distributed Systems, 2nd Edition
book

Distributed Systems, 2nd Edition

by Sukumar Ghosh
July 2014
Intermediate to advanced
554 pages
17h 49m
English
Chapman and Hall/CRC
Content preview from Distributed Systems, 2nd Edition

Chapter 17

Self-Stabilizing Systems

17.1 Introduction

In large-scale distributed systems, failures and perturbations are expected events and not catastrophic exceptions. External intervention to restore normal operation or to perform a system configuration is difficult, and it will only get worse in the future. Therefore, means of recovery have to be built in.

Fault-tolerance techniques can be divided into two broad classes: masking and nonmasking. Certain types of applications call for masking type of tolerance, where the effect of the failure is completely invisible to the application; these include safety–critical systems, some real-time systems, and certain sensitive database applications in the financial world. For others, nonmasking tolerance ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Distributed Systems Observability

Distributed Systems Observability

Cindy Sridharan
Distributed Systems Architecture

Distributed Systems Architecture

Arno Puder, Kay Römer, Frank Pilhofer

Publisher Resources

ISBN: 9781466552975