Skip to Content
Distributed Systems, 2nd Edition
book

Distributed Systems, 2nd Edition

by Sukumar Ghosh
July 2014
Intermediate to advanced
554 pages
17h 49m
English
Chapman and Hall/CRC
Content preview from Distributed Systems, 2nd Edition

Chapter 12

Fault-Tolerant Systems

12.1 Introduction

A fault is the manifestation of an unexpected behavior, and fault tolerance is a mechanism that masks or restores the expected behavior of a system following the occurrence of faults. Attention to fault tolerance or dependability has drastically increased over the recent years due to our increased dependence on computers to perform critical as well as noncritical tasks. Also, the increase in the scale of such systems indirectly contributes to the rising number of faults. Advances in hardware engineering can make the individual components more dependable, but it cannot eliminate them altogether. Bad system designs and behavioral patterns like mobility can also contribute to failures.

Historically, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Distributed Systems Observability

Distributed Systems Observability

Cindy Sridharan
Distributed Systems Architecture

Distributed Systems Architecture

Arno Puder, Kay Römer, Frank Pilhofer

Publisher Resources

ISBN: 9781466552975