November 2019
Intermediate to advanced
304 pages
8h 40m
English
In step 1, we added all the transformations that are needed for the dataset. TransformProcess defines an unordered list of all the transformations that we want to apply to the dataset. We removed any unnecessary features by calling removeColumns(). During schema creation, we marked the categorical features in the Schema. Now, we can actually decide on what kind of transformation is required for a particular categorical variable. Categorical variables can be converted into integers by calling categoricalToInteger(). Categorical variables can undergo one-hot encoding if we call categoricalToOneHot(). Note that the schema needs to be created prior to the transformation process. We need the schema to create a TransformProcess ...