July 2018
Intermediate to advanced
474 pages
13h 37m
English
This section will require importing StringIndexer from pyspark.ml.feature. In order to ensure proper feature selection, we will need to map string columns to columns of indices. This will help generate distinct numeric values for categorical variables that will provide ease of computation for the machine learning model to ingest the independent variables used to predict the target outcome.
Read now
Unlock full access