Chapter 7

Problem solution

Abstract

Here, we wrap up the investigation phase with a highly methodical and efficient post-mortem strategy, including proper collection and analysis of data while keeping the business prime objectives and environment criticality in mind. This chapter follows with a layered approach to isolating problems, implementing fixes, and following up with a measurable resolution, using accepted industry methods.

Keywords

data
clutter
design of experiment
statistical engineering
component search
pairwise comparison
root cause
monitoring
So far, in the previous chapters, we have learned about a range of technologies and techniques that can be helpful in problem solving in mission-critical, high-performance compute environments. ...

Get Problem-solving in High Performance Computing now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.