O'Reilly logo

IBM High Performance Computing Cluster Health Check by Fernando Pizzano, Thorsten Nitsch, Justin I. Morosi, Herbert Mehlhose, Markus Hilger, Jie Gong, Rico Franke, Murali Dhandapani, Manmohan Brahma, Shivendra Ashish, Ross Aiken, Dino Quintero

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Introduction
This IBM Redbooks publication mainly provides information about IBM High Performance Computing (HPC) clusters. Therefore, in the rest part of this book, unless otherwise mentioned, the term cluster represents an IBM HPC cluster. A computer cluster is a group of connected computers that work together. In many respects, they act as a single system.
We describe the concepts, processes, and methodologies used to achieve and maintain a “healthy” state for an IBM HPC system, both in pre-production stage and production stage.In the context ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required