4
Clustering
Clustering is an unsupervised machine learning method that’s used for splitting the original dataset of objects into groups classified by properties. In machine learning, an object is typically represented as a point in a multidimensional metric space. Every space dimension corresponds to an object property (feature), and the metric is a function of the values of these properties. Depending on the types of dimensions in this space, which can be both numerical and categorical, we choose a type of clustering algorithm and specific metric function. This choice depends on the nature of different object properties’ types.
At the present stage, clustering is often used as the first step in data analysis. The task of clustering was formulated ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access