An intensity-based deep approach to mitigate step-imbalance problem under extreme paucity of images from rare classes

DOI: https://doi.org/10.1007/s11042-024-19303-8
IF: 2.577
2024-05-10
Multimedia Tools and Applications
Abstract:In this manuscript, a deep learning-based solution to the step-imbalance problem has been investigated for multi-class image annotation task where the number of training images from some of the rare classes is extremely low. Step-imbalance is a complex sub-problem of the popular class-imbalance problems where there is a steep (stair-like) disparity amongst the sample frequencies for the majority, medium and minority classes. Contrasting to the classical solutions to class- imbalance and long-tailed distribution problems, here there is a huge gap in the sample frequencies between majority and minority classes thus forming a staircase function in the overall class frequency distribution. Moreover, the pro- posed methodology focuses on a robust solution by operating under an extreme scarcity of labeled images from the minority classes (for example, below 20 images per class). Due to this, the existing neural solutions based on cost-sensitisation or generative oversampling are ineffective as they rely on the availability of sufficient minority examples in mitigating the effect of a relative and skewed 'class-imbalance ratio' measure. This situation is prevalent in the real-life appli- cations of computer vision, remote sensing and allied domains having severe scarcity of minority examples along with a wide gap between the major-minor class frequencies. To work under a scarce environment, an intensity-based split- ting technique has been explored to automatically extract synthetic samples for oversampling the images from the minority classes devoid of any training. In par- allel, a siamese network-based undersampling technique has been investigated for selective fusion of non-contrasting images from the majority classes. In overall, a 6–19% improvement over the existing approaches in terms of precision, recall and F1-score has been observed for the proposed technique while experimenting with CIFAR-10, Natural Images and ASCD datasets.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?