May 2023
Intermediate to advanced
352 pages
11h 19m
English
The world of data is a wild and dangerous place for a data scientist. We must contend with different types of data, such as counts, categories, and strings, strewn with missing values and noise. We are asked to build predictive models for different types of tasks: binary classification, multiclass classification, regression, and ranking.
We have to build our machine-learning pipelines and preprocess our data with care to avoid data leakage. They have to be accurate, fast, robust, and meme-worthy (ok, that last one is probably optional). After all this, we end up with models that may well do the job they were trained for but are ultimately black boxes that no one understands. ...
Read now
Unlock full access