Effective Feature Selection on Data with Uncertain Labels

Bo Wang,Yan Jia,Yi Han,Weihong Han
DOI: https://doi.org/10.1109/icde.2009.170
2009-01-01
Abstract:Nowadays, various learning technologies are required on uncertain data. As an important pre-processing step in data mining, feature selection needs to consider this vagueness or uncertainty. In this paper, we propose a novel algorithm to evaluate the correlation between features and uncertain class labels on the basis of Hilbert-Schmidt Independence Criterion. Consequently, the features can be ranked according to this criterion. Experimental results on extensive datasets demonstrate the benefits of our method.
What problem does this paper attempt to address?