October 2013
Beginner to intermediate
382 pages
9h 55m
English
Filling missing values with the mean or median is a common approach to removing missing values. Modeler has mechanisms for computing and filling missing values using either the Set Globals node or the Data Audit node. Unfortunately, both of these are terminal nodes and therefore require the user to run them as a separate step or as a script. Moreover, the options for which values to impute with are limited to the mean, mid-point, or (in the case of the Data Audit node) a constant.
In this recipe we will impute missing values with the median of a variable in-stream, without the use of @GLOBAL variables.
This recipe uses the following files:
cup98lrn_reduced_vars3.savRecipe - impute ...Read now
Unlock full access