February 2019
Beginner to intermediate
308 pages
7h 42m
English
In machine learning projects, it is common to receive datasets with categorical variables. Here are some examples of categorical variables in datasets:
Machine learning algorithms such as neural networks are unable to work with such categorical variables as they expect numerical variables. Therefore, we need to perform preprocessing on these variables before feeding them into a machine learning algorithm.
One common way to convert these categorical variables into numerical variables is a technique known as one-hot encoding, implemented by the get_dummies() function in pandas. One-hot encoding ...