Journal Articles

Tight clustering for large datasets with an application to gene expression data

Bikram Karmakar, University of Pennsylvania
Sarmistha Das, Indian Statistical Institute, Kolkata
Sohom Bhattacharya, Indian Statistical Institute, Kolkata
Rohan Sarkar, Indian Statistical Institute, Kolkata
Indranil Mukhopadhyay, Indian Statistical Institute, Kolkata

Article Type

Research Article

Publication Title

Scientific Reports

Abstract

This article proposes a practical and scalable version of the tight clustering algorithm. The tight clustering algorithm provides tight and stable relevant clusters as output while leaving a set of points as noise or scattered points, that would not go into any cluster. However, the computational limitation to achieve this precise target of tight clusters prohibits it from being used for large microarray gene expression data or any other large data set, which are common nowadays. We propose a pragmatic and scalable version of the tight clustering method that is applicable to data sets of very large size and deduce the properties of the proposed algorithm. We validate our algorithm with extensive simulation study and multiple real data analyses including analysis of real data on gene expression.

DOI

10.1038/s41598-019-39459-w

Publication Date

12-1-2019

Comments

Open Access, Gold, Green

Recommended Citation

Karmakar, Bikram; Das, Sarmistha; Bhattacharya, Sohom; Sarkar, Rohan; and Mukhopadhyay, Indranil, "Tight clustering for large datasets with an application to gene expression data" (2019). Journal Articles. 606.
https://digitalcommons.isical.ac.in/journal-articles/606

Link to Full Text

COinS

Journal Articles

Tight clustering for large datasets with an application to gene expression data

Article Type

Publication Title

Abstract

DOI

Publication Date

Comments

Recommended Citation

Browse

Search

Author Corner

Links

Journal Articles

Tight clustering for large datasets with an application to gene expression data

Authors

Article Type

Publication Title

Abstract

DOI

Publication Date

Comments

Recommended Citation

Share

Browse

Search

Author Corner

Links