Weighted Fuzzy C-Means: Unsupervised Feature Selection to Realize a Target Partition

Article Type

Research Article

Publication Title

International Journal of Uncertainty Fuzziness and Knowledge Based Systems

Abstract

We introduce an unsupervised feature selection method based on regularized weighted Fuzzy C-Means (WRFCM) clustering. When the target task is clustering, our objective should be to select a subset of features that can generate the same/similar partition matrix to the partition matrix obtained from the original high dimensional data by a clustering algorithm. To achieve this we propose a novel objective function keeping in view the Fuzzy-C-Means (FCM) clustering algorithm. This approach realizes feature selection within the WRFCM framework, emphasizing features to maintain the FCM-based target partition. We evaluate our method using Normalized Mutual Information (NMI), Adjusted Rand Index (ARI) and Kuhn-Munkres index (KM-index). NMI, and ARI measure the agreement between clusters, i.e, the partition in the lower dimension and the partition of the original data. On the other hand, KM-index measures the disagreement between the two partitions. Experimental results on synthetic and real datasets showcase our method's efficacy in selecting informative features. This approach fills a crucial gap in unsupervised feature selection, making it valuable for real-world applications. The approach is very general in the sense that the target partition can be generated by any clustering algorithm or even by the actual class labels of the data, when they are available.

First Page

1111

Last Page

1134

DOI

10.1142/S0218488524500260

Publication Date

11-1-2024

Share

COinS