Chapter 6. K-means clustering
Humanity has never had more data about more facets of society than it does today. Computers are great for storing data sets, but they need humans to draw meaning through their analysis. Clustering is a computational technique that divides the points in a data set into groups. A successful clustering results in groups that contain points that are related to one another, and whether those relationships are meaningful generally requires human verification.
In clustering, the group (a.k.a. cluster) that a data point belongs to is not predetermined, but instead is decided during the run of the clustering algorithm. In fact, the algorithm is not guided to place any particular data point in any particular cluster by presupposed ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access