Journal Articles

Boosting with lexicographic programming: Addressing class imbalance without cost tuning

Shounak Datta, Indian Statistical Institute, Kolkata
Sayak Nag, Indian Institute of Technology Delhi
Swagatam Das, Indian Statistical Institute, Kolkata

Article Type

Research Article

Publication Title

IEEE Transactions on Knowledge and Data Engineering

Abstract

A large amount of research effort has been dedicated to adapting boosting for imbalanced classification. However, boosting methods are yet to be satisfactorily immune to class imbalance, especially for multi-class problems. This is because most of the existing solutions for handling class imbalance rely on expensive cost set tuning for determining the proper level of compensation. We show that the assignment of weights to the component classifiers of a boosted ensemble can be thought of as a game of Tug of War between the classes in the margin space. We then demonstrate how this insight can be used to attain a good compromise between the rare and abundant classes without having to resort to cost set tuning, which has long been the norm for imbalanced classification. The solution is based on a lexicographic linear programming framework which requires two stages. Initially, class-specific component weight combinations are found so as to minimize a hinge loss individually for each of the classes. Subsequently, the final component weights are assigned so that the maximum deviation from the class-specific minimum loss values (obtained in the previous stage) is minimized. Hence, the proposal is not only restricted to two-class situations, but is also readily applicable to multi-class problems. Additionally, we also derive the dual formulation corresponding to the proposed framework. Experiments conducted on artificial and real-world imbalanced datasets as well as on challenging applications such as hyperspectral image classification and ImageNet classification establish the efficacy of the proposal.

First Page

883

Last Page

897

DOI

10.1109/TKDE.2019.2894148

Publication Date

5-1-2020

Comments

Open Access, Green

Recommended Citation

Datta, Shounak; Nag, Sayak; and Das, Swagatam, "Boosting with lexicographic programming: Addressing class imbalance without cost tuning" (2020). Journal Articles. 310.
https://digitalcommons.isical.ac.in/journal-articles/310

Link to Full Text

COinS

Journal Articles

Boosting with lexicographic programming: Addressing class imbalance without cost tuning

Article Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Comments

Recommended Citation

Browse

Search

Author Corner

Links

Journal Articles

Boosting with lexicographic programming: Addressing class imbalance without cost tuning

Authors

Article Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Comments

Recommended Citation

Share

Browse

Search

Author Corner

Links