Introduce the parallel program execution environment
Discuss the installation of the MPI infrastructure
Define the environment for monitoring a cluster's resources
Survey common monitoring and event generation software packages
The HSI hardware and software infrastructure required to support low-level cluster operations was introduced in previous chapters. We now cover the environment necessary to support the execution of parallel jobs and cluster software, along with the system management tools required to monitor the cluster's health.
Up to this point in the book, we have discussed the lower levels of the cluster's software hierarchy: authentication, ...