Topic Classification of Chinese Document Based on NMF

张磊,冯晓森,项学智
DOI: https://doi.org/10.3969/j.issn.1000-3428.2009.13.009
2009-01-01
Abstract:This paper presents a method based on Non-negative Matrix Factorization(NMF) for Chinese document topic classification.According to NMF, the term-document matrix is decomposed to reveal the relationship between terms.This method solves the problem of synonym and polysemy effectively.Compared with Latent Semantic Indexing(LSI) based on Singular Value Decomposition(SVD), experimental results show that this method has faster computing speed and less memory occupancy.It can improve classification precision when the number of latent semantic index is reduced pronouncedly.
What problem does this paper attempt to address?