K-means clustering

The K-means algorithm is also referred to as vector quantization. What the algorithm does is finds the cluster (centroid) positions that minimize the distances to all points in the cluster. This is done iteratively; the problem with the algorithm is that it can be a bit greedy, meaning that it will find the nearest minima quickly. This is generally solved with some kind of basin-hopping approach where the nearest minima found is randomly perturbed and the algorithm restarted. Due to this fact, the algorithm is dependent on good initial guesses as input.

Suicide rate versus GDP versus absolute latitude

As mentioned in Chapter 4 , Regression, we will analyze the data of suicide rates versus GDP versus absolute latitude or Degrees ...

Get Mastering Python Data Analysis now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Mastering Python Data Analysis by Magnus Vilhelm Persson, Luiz Felipe Martins

K-means clustering

Suicide rate versus GDP versus absolute latitude

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly