Feature Selection Algorithm for Uncertain Text Classification

王博,贾焰,杨树强,周斌
DOI: https://doi.org/10.3321/j.issn:1000-436x.2009.08.005
2009-01-01
Abstract:A novel algorithm called FSUNT was proposed based on HSIC, with the focus on the vagueness and uncertainty which might be taken into account during feature selection. For text data with fixed feature values and uncertain class labels, features were ranked according to the correlation between features and uncertain class labels evaluated by HSIC. The results of experimental evaluation on a variety of datasets show better performance and stability of FSUNT.
What problem does this paper attempt to address?