Model-based clustering
Clustering is part of the unsupervised family of statistical/machine learning tasks and is similar to classification, but a little bit more difficult since we do not know the correct labels!
If we do not know the correct labels we can try grouping data points together. Loosely speaking, points that are closer between themselves, under some metric, are defined as belonging to the same group and separated from the other groups. Clustering has many, many applications; for example, phylogenetics, a branch of biology studying the evolutionary relationships among biological entities, can be framed as clustering techniques applied to and guided by an evolutionary question. A more capitalist-driven application of clustering is determining ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access