Chapter 10. Grouping unlabeled items using k-means clustering


This chapter covers
  • The k-means clustering algorithm
  • Cluster postprocessing
  • Bisecting k-means
  • Clustering geographic points


The 2000 and 2004 presidential elections in the United States were close—very close. The largest percentage of the popular vote that any candidate received was 50.7% and the lowest was 47.9%. If a percentage of the voters were to have switched sides, the outcome of the elections would have been different. There are small groups of voters who, when properly appealed to, will switch sides. These groups may not be huge, but with such close races, they may be big enough to change the outcome of the election.[1] How do you find these groups ...

Get Machine Learning in Action now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.