82 Mining Your Own Business in Telecoms Using DB2 Intelligent Miner for Data
You should define the following three items to decide which time frame of
customer data and churn information are going to be used in the model.
Data window: Time frame for input variables that is used for constructing
model
Forecasting window: Time frame for the prediction and used when sourcing
the target prediction variable (churn indicator). The churn prediction model is
often referred to as “WHO and WHEN” model which means that it tries to
answer the questions: who is going to leave the company and when are they
going to leave? The forecasting window is the “WHEN” part of churn
prediction modeling. In the phase of building model, the forecasting window is
the time frame to examine whether the customers left the company or not.
Time lag: Interval between data window and forecasting window.
In this case, we used six months as a data window, two months as a time lag and
one month as a forecasting window, as shown in Figure 5-1.
In the model building phase, six months of historical data from February to July
for customers who are active as of the end of July is used with churn information,
whether or not these customers left the company in October. This model can be
applied to customers who are active as of the end of August to predict probable
churners in November.
Therefore, in early September, marketing personnel can get the customers list of
those who are likely to leave the company in November, and a two month time
frame is available for them to setup and execute the proper marketing actions.
You can decide about a data window after studying historical churn patterns. You
better avoid certain time frames, if there are some abnormal patterns due to
external impacts. Timeframes of the latest data available to build the prediction
model is a good example of the data window.