Skip to Content
Python Automation Cookbook - Second Edition
book

Python Automation Cookbook - Second Edition

by Jaime Buelta
May 2020
Intermediate to advanced content levelIntermediate to advanced
526 pages
10h 31m
English
Packt Publishing
Content preview from Python Automation Cookbook - Second Edition

7

Cleaning and Processing Data

Some automated tasks will require dealing with large amounts of data. As data grows, two new and distinct problems appear. Processing the task takes too long and input data quality issues cause more problems.

Both problems are well known in the realm of data science dealing with big quantities of data, but the problems can appear even at a smaller scale.

The quality of input data is highly related to the number of sources of the data. In general, data from a single source will be more consistent, but using a single source is limiting. Even if the data comes from the same source, it could still contain inconsistencies or errors.

Some examples of differences could be regional, such as date formats or currencies, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Python Automation Cookbook

Python Automation Cookbook

Jaime Buelta

Publisher Resources

ISBN: 9781800207080Supplemental Content