September 2015
Beginner to intermediate
608 pages
13h 43m
English
At the bottom of the file we looked at in the previous section, you'll see some statistics that suggest how well the data has been clustered:
Inter-Cluster Density: 0.6135607681542804 Intra-Cluster Density: 0.6957348405534836
These two numbers can be considered as the equivalent to the variance within and the variance between measures we have seen in Chapter 2, Inference and Chapter 3, Correlation. Ideally, we are seeking a lower variance (or a higher density) within clusters compared to the density between clusters.
Inter-cluster density is the average distance between cluster centroids. Good clusters probably don't have centers that are too close to each other. If they did, it would indicate the ...
Read now
Unlock full access