Hierarchical clustering

Hierarchical clustering is an unsupervised learning technique where a hierarchy of clusters is built out of observations.

This clustering groups data at various levels of a cluster tree or dendrogram. It is not a single set of clusters, but a hierarchy of multiple levels where clusters at a particular level are joined as clusters on the next level. This allows you to decide the level of clustering that is most suitable.

The hierarchical clusters essentially are of two types:

  • Agglomerative hierarchical clustering: This is a bottom-up method where each observation starts in its own cluster and two other clusters as they go up a hierarchy
  • Divisive hierarchical clustering: This is a top-down approach where observations start off ...

Get Mastering Python for Data Science now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.