Text Categorization Based on Macro Feature Fusion

Dandan WANG,Qingcai CHEN,Xiaolong WANG,Buzhou TANG
2017-01-01
Abstract:Macro feature extraction methods are a typical feature extraction methods for text categorization.These methods fall into two categories:supervised macro feature extraction and unsupervised macro feature extraction.In this paper,we study the effect of the fusion of the two categories of macro features,which are both proved positive to text categorization.In particular,two types of supervised macro features and three types of unsupervised macro features are taken into account.Experiments conducted on three corpora,including two public corpora (i.e.,Reuters-21578 and 20-Newsgroup) and one automatically constructed corpus,show that the fusion of supervised and unsupervised macro features is more effective than using any of them individually.
What problem does this paper attempt to address?