The Influence of Class Imbalance on Cost-Sensitive Learning: an Empirical Study.

Xu-Ying Liu,Zhi-Hua Zhou
DOI: https://doi.org/10.1109/icdm.2006.158
2006-01-01
Abstract:In real-world applications the number of examples in one class may overwhelm the other class, but the primary interest is usually on the minor class. Cost-sensitive learning has been deeded as a good solution to these class-imbalanced tasks, yet it is not clear how does the class-imbalance affect cost-sensitive classifiers. This paper presents an empirical study using 38 data sets, which discloses that class-imbalance often affects the performance of cost-sensitive classifiers: When the misclassification costs are not seriously unequal, cost-sensitive classifiers generally favor natural class distribution although it might be imbalanced; while when misclassification costs are seriously unequal, a balanced class distribution is more favorable.
What problem does this paper attempt to address?