Chapter 5: Cleaning and Visualizing Data

According to Anaconda's latest State of Data Science Report (https://bit.ly/3F2D8YM), 39% of your time as a data scientist will be spent on either data preparation or cleaning. This might come as no surprise, but being able to set up a problem correctly is vital to being able to get good answers from your data.

Rarely will data come to you in a perfect form, and even then, you might want to manipulate it to answer different questions from it. Being able to quickly find general statistics, discovering and removing bad columns, and altering fields in place will all be needed.

After it's in the right form, visualization is a key tool to be able to not only present your findings to those that might care about ...

Get Building Data Science Solutions with Anaconda now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.