O'Reilly logo

IBM High Performance Computing Cluster Health Check by Fernando Pizzano, Thorsten Nitsch, Justin I. Morosi, Herbert Mehlhose, Markus Hilger, Jie Gong, Rico Franke, Murali Dhandapani, Manmohan Brahma, Shivendra Ashish, Ross Aiken, Dino Quintero

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Toolkits for verifying health (individual diagnostics)
To determine the health of a cluster, it is necessary to get the current state of all of the components that build it up in most installations:
Compute nodes
Ethernet network
InfiniBand network
Storage
In this chapter, we describe the IBM Cluster Health Check (CHC) toolkit, which is used to perform checks on these components. Working with these results, we are able to state if the cluster is healthy.
This chapter provides information about the following topics:

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required