O'Reilly logo

Spark Cookbook by Rishi Yadav

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Clustering using k-means

Cluster analysis or clustering is the process of grouping data into multiple groups so that the data in one group is similar to the data in other groups.

The following are a few examples where clustering is used:

  • Market segmentation: Dividing the target market into multiple segments so that the needs of each segment can be served better
  • Social network analysis: Finding a coherent group of people in the social network for ad targeting through a social networking site such as Facebook
  • Data center computing clusters: Putting a set of computers together to improve performance
  • Astronomical data analysis: Understanding astronomical data and events such as galaxy formations
  • Real estate: Identifying neighborhoods based on similar features ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required