Clustering of Mixed Data by Integrating Fuzzy, Probabilistic, and Collaborative Clustering Framework
Article Type
Research Article
Publication Title
International Journal of Fuzzy Systems
Abstract
Clustering of numerical data is a very well researched problem and so is clustering of categorical data. However, when it comes to clustering of data with mixed attributes, the literature is not that rich. For numerical data, fuzzy clustering, in particular, the fuzzy c-means (FCM), is a very effective and popular algorithm, while for categorical data, use of mixture model is quite popular. In this paper, we propose a novel framework for clustering of mixed data which contains both numerical and categorical attributes. Our objective is to find the cluster substructures that are common to both the categorical and numerical data. Our formulation is inspired by the FCM algorithm (for dealing with numerical data), mixture models (for dealing with categorical data), and the collaborative clustering framework for aggregation of the two—it is an integrated approach that judiciously uses all three components. We use our algorithm on a few commonly used datasets and compare our results with those by some state of the art methods.
First Page
339
Last Page
348
DOI
10.1007/s40815-016-0168-y
Publication Date
6-1-2016
Recommended Citation
Pathak, Arkanath and Pal, Nikhil R., "Clustering of Mixed Data by Integrating Fuzzy, Probabilistic, and Collaborative Clustering Framework" (2016). Journal Articles. 4145.
https://digitalcommons.isical.ac.in/journal-articles/4145