Chapter 9: Data Exploration: Visual and Automated Tools to Detect Problems

Introduction

Common Issues to Anticipate

On the Hunt for Dirty Data

Distribution

Columns Viewer

Multivariate (Correlations and Scatterplot Matrix)

More Tools within the Multivariate Platform

Principal Components

Outlier Analysis

Item Reliability

Explore Outliers

Quantile Range Outliers

Robust Fit Outliers

Multivariate Robust Outliers

Multivariate k-Nearest Neighbors Outliers

Explore Missing

Conclusion

References

Introduction

In Part II of this book, we’ve discussed many of the issues that arise on the path from disparate raw data sources to a consolidated JMP data table. Earlier chapters have discussed issues such as adjusting data types or modeling types “on the way into” ...

Get Preparing Data for Analysis with JMP now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.