Weighted Fuzzy C-Means: Unsupervised Feature Selection to Realize a Target Partition
Article Type
Research Article
Publication Title
International Journal of Uncertainty Fuzziness and Knowledge Based Systems
Abstract
We introduce an unsupervised feature selection method based on regularized weighted Fuzzy C-Means (WRFCM) clustering. When the target task is clustering, our objective should be to select a subset of features that can generate the same/similar partition matrix to the partition matrix obtained from the original high dimensional data by a clustering algorithm. To achieve this we propose a novel objective function keeping in view the Fuzzy-C-Means (FCM) clustering algorithm. This approach realizes feature selection within the WRFCM framework, emphasizing features to maintain the FCM-based target partition. We evaluate our method using Normalized Mutual Information (NMI), Adjusted Rand Index (ARI) and Kuhn-Munkres index (KM-index). NMI, and ARI measure the agreement between clusters, i.e, the partition in the lower dimension and the partition of the original data. On the other hand, KM-index measures the disagreement between the two partitions. Experimental results on synthetic and real datasets showcase our method's efficacy in selecting informative features. This approach fills a crucial gap in unsupervised feature selection, making it valuable for real-world applications. The approach is very general in the sense that the target partition can be generated by any clustering algorithm or even by the actual class labels of the data, when they are available.
First Page
1111
Last Page
1134
DOI
10.1142/S0218488524500260
Publication Date
11-1-2024
Recommended Citation
Sarkar, Kaushik; Mudi, Rajani K.; and Pal, Nikhil R., "Weighted Fuzzy C-Means: Unsupervised Feature Selection to Realize a Target Partition" (2024). Journal Articles. 5190.
https://digitalcommons.isical.ac.in/journal-articles/5190