Errata

Practical Python Data Wrangling and Data Quality

Errata for Practical Python Data Wrangling and Data Quality

Submit your own errata for this product.

The errata list is a list of errors and their corrections that were found after the product was released.

The following errata were submitted by our customers and have not yet been approved or disproved by the author or editor. They solely represent the opinion of the customer.

Color Key: Serious technical mistake Minor technical mistake Language or formatting error Typo Question Note Update

Version Location Description Submitted by Date submitted
chapter 7
https://learning.oreilly.com/library/view/practical-python-data/9781492091493/ch07.html#:-:text=2020-09-01%2000%3A08%3A17.5150

"2020-09-01 00:08:17.5150" should be "2020-09-01 08:17:00.5150"

Tobias Schreiner  Oct 25, 2022 
chapter 7
https://learning.oreilly.com/library/view/practical-python-data/9781492091493/ch07.html#:-:text=output_file%20%3D%20open(%222020-09-01-citibike-tripdata.csv%22%2C%22w%22)

On Windows, I ran into this issue: https: //stackoverflow.com/questions/3348460/csv-file-written-with-python-has-blank-lines-between-each-row - it might be worth mentioning in the book.

Tobias Schreiner  Oct 26, 2022 
chapter 7
https://learning.oreilly.com/library/view/practical-python-data/9781492091493/ch07.html#:-:text=from%207%20a.m.%20to%209%20a.m.%20 https://learning.oreilly.com/library/view/practical-python-data/9781492091493/ch07.html#:-:text=to%20see%20if%20it%20matches

The two statements contradict each other: you want to filter for rides between 7:00 AM and 9:00 AM, but you implement a filter for 7:00 AM to 9:59 AM.

Tobias Schreiner  Oct 26, 2022 
chapter 7
https://learning.oreilly.com/library/view/practical-python-data/9781492091493/ch07.html#:-:text=source_workbook_metadata.close()

The metadata file needs to be closed after the loop.

Tobias Schreiner  Oct 26, 2022 
Other Digital Version Page 198
Last paragraph

7,207 is a little less than 1% of 790,620 instead of 0,1%

Anonymous  Oct 22, 2023 
ePub Page 6
https://learning.oreilly.com/library/view/practical-python-data/9781492091493/ch06.html#:-:text=0.1%25

1 %, not 0.1 %

Tobias Schreiner  Oct 03, 2022 
Printed Page 8
4th paragraph

SQL stands for Structured Query Language.

Joachim D.  Oct 19, 2022 
Other Digital Version 38
Installing Atom

The Atom editor is discontinued as of December 2022. A replacement should be recommended.

Anonymous  Sep 28, 2023 
Printed Page 196
Example 6-4

there seems to be an copy-paste-error in the beginning of example 6-4, because ppp_data_join isn' a "quick script for creating new CSVs that each contain the first few rows of our larger data files"; see example 6-3 before

Andreas S.  Oct 12, 2022 
Other Digital Version 212
Paragraph before last in Python code snippet

The condition ppp_data['MORTGAGE_INTEREST_PROCEED'].isna() occurs twice, one of the occurrances should be ppp_data['PAYROLL_PROCEED'].notna() instead.

Anonymous  Oct 22, 2023