Comparison of Several Smoothing Methods in Statistical Language Model

Yang Liu,Jiasong Sun,Zuoying Wang
2000-01-01
Abstract:With the development of computer technology and the appearance of huge training text corpus, the performance of language model has improved a lot recently. But its intrinsic sparse data problem still exists. This paper investigates several smoothing methods in the application of Chinese continuous speech recognition. We compare the performance of different methods, particularly in the situation of pruned language model and conclude that the KneserNey strategy is better for the model without pruning while its performance decreases for the pruned language model.
What problem does this paper attempt to address?