Application of a non-linear dimension reduction algorithm on document clustering

SUN Yue-heng,HOU Yue-xian,HE Pi-lian
DOI: https://doi.org/10.3724/sp.j.1087.2008.00488
2008-01-01
Journal of Computer Applications
Abstract:This paper presented a non-linear dimension reduction algorithm-Self-organizing Isometric Embedding(SIE)to compress high-dimensional document data.The algorithm was then validated in document clustering by being compared with the typical linear dimension reduction algorithm-Latent Semantic Indexing(LSI).Experimental results show that while significantly lowering the complexity,the performance of SIE is better than that of LSI and the benchmark.
What problem does this paper attempt to address?