O'Reilly logo

Practical Predictive Analytics by Ralph Winters

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Example – setting the outliers to NA

The next step is of course to start examining these outliers. For our example, the extremes are part of the random number generation process, therefore they are not really outliers. However, if you encountered this situation in your own data, you would begin to track down the reasons these extremes occur. Start by trying to associate these extremes with other data elements. Perhaps these outliers are appearing in certain age groups and not in others.

For our example, we will simply be setting the value to NA for these extreme values. We will also be creating a new variable, v1x, to house the new variable, and will not overwrite the value of the original variable. As you investigate new ways of detecting ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required