Zero-probabilities of language model in translations of Chinese spellings to characters

Ruiqiang Zhang,Zuoying Wang,Dajun Lu
1998-01-01
Tien Tzu Hsueh Pao/Acta Electronica Sinica
Abstract:This paper deals with the problem of language model in translation of Chinese spelling to characters. From the point of perplexity of language model, this paper discusses the efficiency of three kinds of methods on probabilities estimation of sparse data, which are back-off, deleted interpolation and nonlinear interpolation. Moreover, an iterative formula of parameters to get the minimum perplexity of language model under the three methods is proposed and proved by experiment.
What problem does this paper attempt to address?