Incorporating Latent Semantic Indexing into Spectral Graph Transducer for Text Classification.

Xinyu Dai,Baoming Tian,Junsheng Zhou,Jiajun Chen
2008-01-01
Abstract:Spectral Graph Transducer(SGT) is one of the superior graph-based transductive learning methods for classi- fication. As for the Spectral Graph Transducer algo- rithm, a good graph representation for data to be pro- cessed is very important. In this paper, we try to in- corporate Latent Semantic Indexing(LSI) into SGT for text classification. Firstly, we exploit LSI to represent documents as vectors in a latent semantic space since we propose that the documents and their semantic rela- tionships can be reflected more pertinently in this latent semantic space. Then, a graph needed by SGT is con- structed. In the graph, a node corresponds to a vector from LSI. Finally, we apply the graph to Spectral Graph Transducer for text classification. The experiments gave us excellent results on both English and Chinese text classification datasets and demonstrated the validation of our assumption.
What problem does this paper attempt to address?