Performing cluster analysis using K-means clustering
The standard R package stats
provides the function for K-means clustering. We also use the cluster
package to plot the results of our cluster analysis.
Getting ready
If you have not already downloaded the files for this chapter, do it now and ensure that the auto-mpg.csv
file is in your R working directory. Also, ensure that you have installed the cluster
package.
How to do it...
To perform cluster analysis using K-means clustering, follow theses steps:
- Read the data:
> auto <- read.csv("auto-mpg.csv")
- Define a convenience function to standardize the relevant variables and append the resulting variables to the original data:
rdacb.scale.many <- function (dat, column_nos) { nms <- names(dat) for (col ...
Get R: Recipes for Analysis, Visualization and Machine Learning now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.