Chapter 1: Data Management in the Analytics Process

Introduction

A Continuous Process

Asking Questions that Data Can Help to Answer

Sourcing Relevant Data

Reproducibility

Combining and Reconciling Multiple Sources

Identifying and Addressing Data Issues

Data Requirements Shaped by Modeling Strategies

Plan of the Book

Conclusion

References

Introduction

Although reliable estimates are difficult to come by, there seems to be consensus that data preparation—locating, assembling, reconciling, merging, cleaning, and so on—consumes something like 80% of the time required for a statistical project (Press 2016). In comparison to the literature about building statistical models and performing analysis, there are relatively few books written on the topic ...

Get Preparing Data for Analysis with JMP now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.