O'Reilly logo

IBM SPSS Modeler Cookbook by Scott Mutchler, Tom Khabaza, Meta S. Brown, Dean Abbott, Keith McCormick

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Using a full data model/partial data model approach to address missing data

It is common in data mining to have one category of customers more prone to having missing data. In fact, there may be a category of customers that are assured to have certain data missing. For instance, let's say that you have found in running your cell phone business that calculating the distance in time between phone upgrades is useful in estimating when the customer's next phone upgrade will be. A newly acquired customer will not have any prior phone history in the data set, but it would be risky to assume that your established customers are the same as your new customers.

How then to estimate the value of average months between new phones? One approach is to simply ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required