Introduction to Clustering Methods
Clustering is a multivariate technique of grouping rows together that share similar values. It can use any number of variables. The variables must be numeric variables for which numerical differences make sense. The common situation is that data are not scattered evenly through n-dimensional space, but rather they form clumps, locally dense areas, modes, or clusters. The identification of these clusters goes a long way toward characterizing the distribution of values.
JMP provides two approaches to clustering:
• hierarchical clustering for small tables, up to several thousand rows
• k-means and normal mixtures clustering for large tables, up to hundreds of thousands of rows.
Hierarchical clustering is also ...