The objectives of this chapter are to show you how to assess your data science data for invalid or erroneous data values.
Caution
I have found that in 90% of my projects, I spend 70% of the processing work on this step, to improve the quality of the data used. This step will definitely improve the quality of your data science.
I urge that you spend the time to “clean up” the data before you progress to the data science, as the incorrect data entries will cause a major impact on the later steps in the process. Perform a data science project on ...