Handling Class Imbalance by Estimating Minority Class Statistics
Document Type
Conference Article
Publication Title
Proceedings of the International Joint Conference on Neural Networks
Abstract
The problem of class imbalance arises in machine learning due to the unequal class-specific distribution of data, where most samples belong to one class, and only a few represent the others. To tackle this issue, one paradigm is to use oversampling techniques that synthesize artificial samples of the minority class using the convex combination of the minority class samples taken in some specialized way for different methods. Existing methods do not take into account any information regarding the actual distribution of the minority class, which leads to inconsistencies between the generated distribution and the actual distribution that the minority class might have. In this paper, we propose a parametrization-based method that tries to estimate the statistics of the minority class samples using the statistics of the nearby classes. Using the different hyperparameters, we can control the distribution such that it may approximate the original distribution. Experiments using synthetic and real-world benchmark datasets demonstrate the usefulness of our techniques across multiple metrics.
DOI
10.1109/IJCNN54540.2023.10191975
Publication Date
1-1-2023
Recommended Citation
Ansari, Faizanuddin; Das, Swagatam; and Shamsolmoali, Pourya, "Handling Class Imbalance by Estimating Minority Class Statistics" (2023). Conference Articles. 584.
https://digitalcommons.isical.ac.in/conf-articles/584