LSASGT:an Approach to Text Categorization Based on Latent Semantic Analysis and Spectral Graph Transducer

戴新宇,田宝明,周俊生,陈家骏
DOI: https://doi.org/10.3321/j.issn:0372-2112.2008.08.027
2008-01-01
Abstract:In this paper,an approach to text categorization named LSASGT is proposed,which combines Latent Semantic Analysis(LSA) with Spectral Graph Transducer(SGT) for the task of text categorization.For both LSA and SGT are originated from spectral analysis theory which can mine some latent structure information within all training and testing data,we integrate them tightly in one model.Firstly,according to the characteristic of natural language,LSA is used to represent documents in a latent semantic space in which documents and their semantic relationships can be reflected more pertinently.Then we construct a graph based on the latent concept-based subspace,and apply the graph into SGT for text categorization.The experiments demonstrate that LSASGT can improve classification performance on both English and Chinese datasets of Reuters21578 and TanCorp-12.
What problem does this paper attempt to address?