February 2018
Intermediate to advanced
378 pages
10h 14m
English
Dealing with data is hard; that's why we call it data science and data mining! Many different things can go wrong at different stages. Ben mentions data insufficiency, data leakage, non-stationary distributions, poor data sampling and splitting, data quality, and poorly anonymized data. Let's add a few more.
Read now
Unlock full access