Journal Articles

Two provably consistent divide-and-conquer clustering algorithms for large networks

Soumendu Sundar Mukherjee, Indian Statistical Institute, Kolkata
Purnamrita Sarkar, The University of Texas at Austin
Peter J. Bickel, University of California, Berkeley

Article Type

Research Article

Publication Title

Proceedings of the National Academy of Sciences of the United States of America

Abstract

In this article, we advance divide-and-conquer strategies for solving the community detection problem in networks. We propose two algorithms that perform clustering on several small subgraphs and finally patch the results into a single clustering. The main advantage of these algorithms is that they significantly bring down the computational cost of traditional algorithms, including spectral clustering, semidefinite programs, modularitybased methods, likelihood-based methods, etc., without losing accuracy, and even improving accuracy at times. These algorithms are also, by nature, parallelizable. Since most traditional algorithms are accurate, and the corresponding optimization problems are much simpler in small problems, our divide-and-conquer methods provide an omnibus recipe for scaling traditional algorithms up to large networks. We prove the consistency of these algorithms under various subgraph selection procedures and perform extensive simulations and real-data analysis to understand the advantages of the divide-and-conquer approach in various ttings.

DOI

10.1073/pnas.2100482118

Publication Date

11-2-2021

Comments

Open Access, Green

Recommended Citation

Mukherjee, Soumendu Sundar; Sarkar, Purnamrita; and Bickel, Peter J., "Two provably consistent divide-and-conquer clustering algorithms for large networks" (2021). Journal Articles. 1716.
https://digitalcommons.isical.ac.in/journal-articles/1716

This document is currently not available here.

COinS

Journal Articles

Two provably consistent divide-and-conquer clustering algorithms for large networks

Article Type

Publication Title

Abstract

DOI

Publication Date

Comments

Recommended Citation

Browse

Search

Author Corner

Links

Journal Articles

Two provably consistent divide-and-conquer clustering algorithms for large networks

Authors

Article Type

Publication Title

Abstract

DOI

Publication Date

Comments

Recommended Citation

Share

Browse

Search

Author Corner

Links