O'Reilly logo

Bioinformatics with R Cookbook by Paurush Praveen Sinha

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Data clustering in R using k-means and hierarchical clustering

Clustering is an unsupervised learning task in data mining. It aims to group a set of data points in such a way that those in the same group are more similar to each other than to those in other groups. These groups are unlabeled and are called clusters. The goal of clustering is to minimize the distances between the data points within the cluster and maximize the distances between the clusters. There are several functions available in R to perform different kinds of clustering. This recipe will explain some of these functions.

Getting ready

To perform clustering, we need our dataset to be clustered. We also need to decide the number of groups that we intend to organize our clusters ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required