Chapter 4. How to discover the characteristics of your customers 61
If we give the demographic clustering algorithm a similarity threshold and do not
limit the number of clusters that it can produce, it will keep trying to find the
minimum number of clusters that satisfy the similarity threshold, because this
also maximizes the Condorcet value.
4.6 Interpreting the results
In the previous section we looked at the steps we have to follow to get our mining
results using two different clustering techniques. The
sixth stage in our generic
mining method
is to interpret the results that we have obtained and determine
how we can map them onto our business. When you are first confronted with the
cluster results the first question that you are going to ask is “What does it all
mean?”. In this section we describe how to understand and read the visualized
segmentation output and how to interpret the results from the neural clustering
technique that has generated nine different segments of customers.
4.6.1 How to read and evaluate the results
The cluster techniques that we have used both produce results that can be
displayed graphically. Although the visualized results are important in giving us
an overall impression of what is happening, evaluating the result with the
business point of view should not be neglected.
How to read the result?
The graphic output of neural clustering is shown in Figure 4-2. Each segment (or
cluster) is interpreted with its own behaviors and demographic and additional
variables.
Note: Condorcet like similarity has value in the ranges zero to one and is a
measure of how similar a customer is to other customers within the cluster and
how dissimilar they are to customers in other clusters. It has a value of 1 when
all customers in a cluster are identical and where there are no customers
outside the cluster that have the same characteristic. A value of zero indicates
that the customers are distributed randomly among the clusters. The
condorcet value can be calculated for all the customer variables or for each
variable separately.