A new latent semantic analysis language model

Jisheng Ren,Zuoying Wang
DOI: https://doi.org/10.3321/j.issn:1002-0470.2005.08.001
2005-01-01
Abstract:Latent semantic analysis automatically uncovered the salient semantic relationships between words in a given training corpus by a novel faster method for quantizing word via clustering, it was used for mandarin speech recognition through combining with trigram model via a new proposed static geometric weighting interpolation manner. Experiments show that it outperformed the traditional singular value decomposition-based latent semantic analysis model for its better efficiency and performance. Compared with the trigram model, the reduction of relative recognition error rate is about 3.6%-7.1%. Furthermore, it provides a novel approach for improving latent semantic analysis model through quantizing word pair effectively.
What problem does this paper attempt to address?