April 2018
Beginner to intermediate
282 pages
6h 52m
English
Categorical data in nature is non-parametric. This means that it doesn't follow any data distributions. However, for using those variables in a parametric model they need to be transformed using various encoding methods, missing values are to be replaced, and we can reduce the number of categories using binning techniques.