July 2018
Intermediate to advanced
474 pages
13h 37m
English
While we did use the user-defined function, udf, to manually create a numerical label column, we also could have used a built-in feature from PySpark called StringIndexer to assign numerical values to categorical labels. To see StringIndexer in action, visit Chapter 5, Predicting Fire Department Calls with Spark ML.
Read now
Unlock full access