A comprehensive unsupervised feature selection method of two-stage strategy

Lü Jing,Tong Ruofeng
DOI: https://doi.org/10.3969/j.issn.2095-2783.2011.04.004
2011-01-01
Abstract:Supervised text feature selection has made many extensive applications,and unsupervised has also gradually been focused on.This paper presents a comprehensive unsupervised feature selection method of two-stage,combining term contribution(TC) and column selection(CS).The method first removes features of not influential in global performance quickly,and then combining Column Selection method presents an objective function for selecting a subset from the rest of features,which can be solved by using the ideas of greedy and transductive experimental design,and finally obtains the streamline feature subset.The experimental results show that our proposed algorithms can outperforms many state-of-the-art methods on text clustering.
What problem does this paper attempt to address?