Chapter 9: Data Exploration: Visual and Automated Tools to Detect Problems

Introduction

Common Issues to Anticipate

On the Hunt for Dirty Data

Distribution

Columns Viewer

Multivariate (Correlations and Scatterplot Matrix)

More Tools within the Multivariate Platform

Principal Components

Outlier Analysis

Item Reliability

Explore Outliers

Quantile Range Outliers

Robust Fit Outliers

Multivariate Robust Outliers

Multivariate k-Nearest Neighbors Outliers

Explore Missing

Conclusion

References

Introduction

In Part II of this book, we’ve discussed many of the issues that arise on the path from disparate raw data sources to a consolidated JMP data table. Earlier chapters have discussed issues such as adjusting data types or modeling types “on the way into” ...

Get Preparing Data for Analysis with JMP now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.