Discussion of Classification for Imbalanced Data Sets

Wei Mei Zhi,Hua Ping Guo,Ming Fan
DOI: https://doi.org/10.4028/www.scientific.net/amr.546-547.622
2012-01-01
Advanced Materials Research
Abstract:Most classifiers lose efficiency with the problem of imbalanced class distribution, which, however, often shows statistical significant in practice. Therefore, the problem of learning from imbalanced datasets has attracted growing attention in recent years. The paper provide a comprehensive review of the classification of imbalanced datasets, the nature of the problem, the factor which affected the problem, the current assessment metrics used to evaluate learning performance, as well as the opportunities and challenges in the learning from imbalanced data.
What problem does this paper attempt to address?