A novel adaptive k-NN classifier for handling imbalance: Application to brain MRI

Article Type

Research Article

Publication Title

Intelligent Data Analysis

Abstract

The problem of efficiently classifying imbalanced data has become one of the most challenging tasks in machine learning. Some real world examples include medical image analysis, fraud detection, fault diagnosis, and anomaly detection. Although several data-level algorithms have been developed to address imbalance, they are typically subject to some restrictions. We propose a novel variant of the k-NN family of classifiers, and name this as Density-based Adaptive-distance kNN (DAkNN). It can effectively handle data with skewed distributions and varying class-densities using the concept of adaptive distance. Comparative superiority is experimentally established over related data-level algorithms (SMOTE, ADASYN), using ten sets of two-class data, in terms of geometric mean (of the true positive and negative rates) and accuracy. Additionally, five sets of multi-class data are considered and compared with different variants of k-NN, which are currently very popular. Finally, DAkNN is successfully applied on the highly imbalanced Lower Grade Glioma (LGG) MR images, with an Average-Dice score of 0.9082 for delineating the tumor regions. The results demonstrate clear superiority over state-of-the-art algorithms.

First Page

909

Last Page

924

DOI

10.3233/IDA-194647

Publication Date

1-1-2020

Share

COinS