An Empirical Study of the Noise Impact on Cost-Sensitive Learning

Xingquan Zhu,Xindong Wu,Taghi M. Khoshgoftaar,Yong Shi
2007-01-01
Abstract:In this paper, we perform an empirical study of the impact of noise on cost-sensitive (CS) learning, through observations on how a CS learner reacts to the mislabeled training examples in terms of misclassification cost and classification accuracy. Our empirical results and theoretical analysis indicate that mislabeled training examples can raise serious concerns for cost-sensitive classification, especially when misclassifying some classes becomes extremely expensive. Compared to general inductive learning, the problem of noise handling and data cleansing is more crucial, and should be carefully investigated to ensure the success of CS learning.
What problem does this paper attempt to address?