58 Mining Your Own Business in Telecoms Using DB2 Intelligent Miner for Data
These two techniques solve the same problems from different views, both can be
used to complement each other and to gain confidence that the segmentation
produced has a valid and optimal result.
4.5.2 Applying the mining techniques
Behavior data is used as active variables to construct the segmentation model
and demographic and additional data are used as only supplementary variables
to describe the segments, which can be identified in segmentation graphical
output by [] brackets around the variable name as shown in Figure 4-2. We
describe some considerations when you apply the clustering technique.
Neural clustering
As we explained earlier, the neural clustering does not require us to specify
things like similarity, but only the number of clusters we want to derive. Neural
clustering requires us to specify the maximum number of clusters. In this
particular case, the number of segments chosen are nine. The reason for
choosing nine is: the objective of behavioral segmentation is to identify the
customer characteristics and it is used as a fundamental of Customer
Relationship Management by the marketing department. Typically, dealing with
more than nine customer segments for the marketing personnel is not easy and
not efficient in terms of deploying the result.
You can specify the maximum number of segments you want to generate.
However, the decision depends on where and how the segmentation result will
be used and should be considered from a marketing perspective. For example, in
fraud detection application, we need many more segments to find abnormal
behaviors in very small segments, which could possibly exist in huge call
transaction data sets. Fraud detection in the telecoms industry will be discussed
in detail in Chapter 7, “Can you determine the characteristics of known and
unknown fraudulent behavior?” on page 131.
Note: Both algorithms generate the score (ranges from 0 to 1) which
represents how well fit each record is in the assigned cluster, in other words,
cluster membership. In demographic clustering, the record is better fitting in
the cluster if the score is closer to 1, whereas in neural clustering it is better
fitting in the cluster if the score is closer to 0.