What makes multi-class imbalanced problems difficult? An experimental study

Mateusz Lango,Jerzy Stefanowski
DOI: https://doi.org/10.1016/j.eswa.2022.116962
IF: 8.5
2022-08-01
Expert Systems with Applications
Abstract:Multi-class imbalanced classification is more difficult and less frequently studied than its binary counterpart. Moreover, research on the causes of the difficulty of multi-class imbalanced data is quite limited and insufficient. Therefore, we experimentally study the impact of various multi-class imbalanced difficulty factors on the performance of three popular classifiers. The results demonstrated a strong influence of the class overlapping with the extent of its impact related to the types of overlapped classes. In particular, overlapping between minority and majority classes was more difficult than the others. The type of the class size configuration turned out to be another important factor, highlighting the special role of the configurations with classes of intermediate sizes. The obtained results could support studying the nature of the multi-class imbalanced data as well as the development of new methods for improving classifiers.
computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science
What problem does this paper attempt to address?