6 Clean and prepare

This chapter covers

  • Understanding the types of errors that you might find in your data
  • Identifying problems in your data
  • Implementing strategies for fixing or working around bad data
  • Preparing your data for effective use in production

When we’re working with data, it’s crucial that we can trust our data and work with it effectively. Almost every data-wrangling project is front-loaded with an effort to fix problems and prepare the data for use.

You may have heard that cleanup and preparation equal 80% of the work! I’m not sure about that, but certainly preparation is often a large proportion of the total work.

Time invested at this stage helps save us from later discovering that we’ve been working with unreliable or ...

Get Data Wrangling with JavaScript now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.