Chapter 1: Data Management in the Analytics Process

Introduction

A Continuous Process

Asking Questions that Data Can Help to Answer

Sourcing Relevant Data

Reproducibility

Combining and Reconciling Multiple Sources

Identifying and Addressing Data Issues

Data Requirements Shaped by Modeling Strategies

Plan of the Book

Conclusion

References

Introduction

Although reliable estimates are difficult to come by, there seems to be consensus that data preparation—locating, assembling, reconciling, merging, cleaning, and so on—consumes something like 80% of the time required for a statistical project (Press 2016). In comparison to the literature about building statistical models and performing analysis, there are relatively few books written on the topic ...

Get Preparing Data for Analysis with JMP now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.