Fault Detection Tools

This section describes products that can be used to detect application failures. Additional tools for monitoring application performance are discussed later in the chapter.


IT/Operations (IT/O) is a sophisticated management product for system operators. Systems in the enterprise can be displayed on maps onscreen, with colors representing their current status. An Application Bank is included for default and customized tools that can be launched from a particular system. Events can be sent from systems throughout the enterprise, and configured recovery actions can then be taken.

IT/O can also be used to monitor specified components on a managed system. Some predefined monitors are provided, one of which has the ...

Get UNIX® Fault Management: A Guide for System Administration now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.