Journal Articles

Detecting Meaningful Clusters from High-dimensional Data: A Strongly Consistent Sparse Center-based Clustering Approach

Saptarshi Chakraborty, Indian Statistical Institute, Kolkata
Swagatam Das, Indian Statistical Institute, Kolkata

Article Type

Research Article

Publication Title

IEEE Transactions on Pattern Analysis and Machine Intelligence

Abstract

In this paper, we propose a Lasso Weighted k-means ($LW$-k-means) algorithm, as a simple yet efficient sparse clustering procedure for high-dimensional data where the number of features ($p$) can be much higher than the number of observations (n). The $LW$-k-means method imposes an $\ell_1$ regularization term involving the feature weights directly to induce feature selection in a sparse clustering framework. We develop a simple block-coordinate descent type algorithm with time-complexity resembling that of Lloyd's method, to optimize the proposed objective. In addition, we establish the strong consistency of the $LW$-k-means procedure. Such consistency proof is not available for the conventional spare k-means algorithms, in general. $LW$-k-means is tested on a number of synthetic and real-life datasets and through a detailed experimental analysis, we find that the performance of the method is highly competitive against the baselines as well as the state-of-the-art procedures for center-based high-dimensional clustering, not only in terms of clustering accuracy but also with respect to computational time

DOI

10.1109/TPAMI.2020.3047489

Publication Date

1-1-2020

Recommended Citation

Chakraborty, Saptarshi and Das, Swagatam, "Detecting Meaningful Clusters from High-dimensional Data: A Strongly Consistent Sparse Center-based Clustering Approach" (2020). Journal Articles. 454.
https://digitalcommons.isical.ac.in/journal-articles/454

Link to Full Text

COinS

Journal Articles

Detecting Meaningful Clusters from High-dimensional Data: A Strongly Consistent Sparse Center-based Clustering Approach

Article Type

Publication Title

Abstract

DOI

Publication Date

Recommended Citation

Browse

Search

Author Corner

Links

Journal Articles

Detecting Meaningful Clusters from High-dimensional Data: A Strongly Consistent Sparse Center-based Clustering Approach

Authors

Article Type

Publication Title

Abstract

DOI

Publication Date

Recommended Citation

Share

Browse

Search

Author Corner

Links