May 2016
Intermediate to advanced
12 pages
56m
English
| Topic | Common Challenges | Suggested Best Practice |
|---|---|---|
|
Data Preparation |
||
|
Data collection |
|
|
|
“Untidy” data |
|
Restructure the data to be “tidy” by using the melt and cast process |
|
Outliers |
|
|
|
Sparse target variables |
|
|
|
Variables of disparate magnitudes |
|
Standardization |
|
High-cardinality variables |
|
|
|
Missing data |
|
|
|
Strong multicollinearity |
Unstable parameter estimates ... | |