© Andreas François Vermeulen 2018
Andreas François VermeulenPractical Data Sciencehttps://doi.org/10.1007/978-1-4842-3054-1_8

8. Assess Superstep

Andreas François Vermeulen1 
(1)
West Kilbride North Ayrshire, UK
 

The objectives of this chapter are to show you how to assess your data science data for invalid or erroneous data values.

Caution

I have found that in 90% of my projects, I spend 70% of the processing work on this step, to improve the quality of the data used. This step will definitely improve the quality of your data science.

I urge that you spend the time to “clean up” the data before you progress to the data science, as the incorrect data entries will cause a major impact on the later steps in the process. Perform a data science project on ...

Get Practical Data Science: A Guide to Building the Technology Stack for Turning Data Lakes into Business Assets now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.