Chapter 7

Problem solution

Abstract

Here, we wrap up the investigation phase with a highly methodical and efficient post-mortem strategy, including proper collection and analysis of data while keeping the business prime objectives and environment criticality in mind. This chapter follows with a layered approach to isolating problems, implementing fixes, and following up with a measurable resolution, using accepted industry methods.

Keywords

data
clutter
design of experiment
statistical engineering
component search
pairwise comparison
root cause
monitoring
So far, in the previous chapters, we have learned about a range of technologies and techniques that can be helpful in problem solving in mission-critical, high-performance compute environments. ...

Get Problem-solving in High Performance Computing now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.