July 2018
Intermediate to advanced
334 pages
8h 20m
English
Let's create the appropriate imports first, as follows:
import org.apache.spark.ml.classification.RandomForestClassifierimport org.apache.spark.ml.evaluation.MulticlassClassificationEvaluatorimport org.apache.spark.ml.param._import org.apache.spark.ml.tuning.{ParamGridBuilder, TrainValidationSplit}import org.apache.spark.ml.{Pipeline, PipelineStage}
Let's start building a pipeline now. This is a pipeline that has three stages, which are StringIndexer, LabelIndexer, and randomForestClassifier.
Read now
Unlock full access