O'Reilly logo

Data Clustering by Chandan K. Reddy, Charu C. Aggarwal

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 9

Clustering High-Dimensional Data

Arthur Zimek

University of Alberta Edmonton, Canadazimek@ualberta.ca

9.1 Introduction

The general definition of the task of clustering as to find a set of groups of similar objects within a data set while keeping dissimilar objects separated in different groups or the group of noise is very common. Although Estivill-Castro criticizes this definition for including a grouping criterion [47], this criterion (similarity) is exactly what is in question among many different approaches. Especially in high-dimensional data, the meaning and definition of similarity is right at the heart of the problem. In many cases, the similarity of objects is assessed within subspaces, e.g., using a subset of the dimensions ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required