Chapter 3

Rough-Fuzzy Clustering: Generalized c-Means Algorithm

3.1 Introduction

Cluster analysis is a technique for finding natural groups present in the data. It divides a given data set into a set of clusters in such a way that two objects from the same cluster are as similar as possible and the objects from different clusters are as dissimilar as possible. In effect, it tries to mimic the human ability to group similar objects into classes and categories [1].

Clustering techniques have been effectively applied to a wide range of engineering and scientific disciplines such as pattern recognition, machine learning, psychology, biology, medicine, computer vision, web intelligence, communications, and remote sensing. A number of clustering algorithms have been proposed to suit different requirements [1, 2]. The hard clustering algorithms generate crisp clusters by assigning each object to exactly one cluster. When the clusters are not well defined, that is, when they are overlapping, one may desire fuzzy clusters. In this regard, the problem of pattern classification is formulated as the problem of interpolation of the membership function of a fuzzy set in Bellman et al. [3], and thereby a link with the basic problem of system identification is established. A seminal contribution to cluster analysis is Ruspini's concept of a fuzzy partition [4]. The application of fuzzy set theory to cluster analysis was initiated by Dunn and Bezdek by developing fuzzy ISODATA [5] and fuzzy

Get Rough-Fuzzy Pattern Recognition: Applications in Bioinformatics and Medical Imaging now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.