O'Reilly logo

Clojure for Data Science by Henry Garner

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Cluster evaluation measures

At the bottom of the file we looked at in the previous section, you'll see some statistics that suggest how well the data has been clustered:

Inter-Cluster Density: 0.6135607681542804
Intra-Cluster Density: 0.6957348405534836

These two numbers can be considered as the equivalent to the variance within and the variance between measures we have seen in Chapter 2, Inference and Chapter 3, Correlation. Ideally, we are seeking a lower variance (or a higher density) within clusters compared to the density between clusters.

Inter-cluster density

Inter-cluster density is the average distance between cluster centroids. Good clusters probably don't have centers that are too close to each other. If they did, it would indicate the ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required