Partitioning using k-means clustering

The goal of partitioning is to place partitions and create clusters that reduce the within cluster sum of square error. In an extreme case, you could achieve a zero sum of square error if every data point existed in its own cluster. This would not be very useful though, would it? So partitioning is about finding the balance between reducing error and finding the right number of clusters.

A commonly used partitioning method is k-means. You will more often see it referred to as k-means clustering. K-means clustering places centers at k locations in the observation space to serve as the means of these k clusters. For example, if you were performing k-means clustering with k = 3, you would place three cluster means ...

Get Introduction to R for Business Intelligence now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.