© Harry J. Foxwell 2020
H. J. FoxwellCreating Good Datahttps://doi.org/10.1007/978-1-4842-6103-3_8

8. Cleaning Your Data

Harry J. Foxwell1 
(1)
Fairfax, VA, USA
 

Garbage in, garbage out.

—George Fuechsel [1]

Okay, so you’ve read this book, learned how to create good data for your projects, and now all your new datasets are squeaky clean and ready for analysis! However, your research colleague borrowed the book but didn’t read it and now has a collection of messy datasets. Now what? Well, next we’ll learn about some methods for detecting bad data and for cleaning it up, often referred to as a component of “data munging” or “data wrangling.”

The goal of data cleaning is to get the data ready for analysis. But how will you know when it is ready? You need ...

Get Creating Good Data: A Guide to Dataset Structure and Data Representation now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.