Language Model Adaptation Based on the Classification of a Trigram's Language Style Feature

Qi LIANG,Fang ZHENG,Ming-xing XU,Wen-hu WU
DOI: https://doi.org/10.3969/j.issn.1003-0077.2006.04.010
2006-01-01
Abstract:In this paper,a language style based adaptive method for language model is proposed based on the differences between oral and written languages.Several interpolation methods based on trigram counts are used for the adaptation.An interpolation method considering Katz smoothing computes weights according to the confidence score of a trigram.An adaptation method based on the classification of a trigram\u0027s style feature computes weights dynamically according to the trigram\u0027s language style tendency with several weight generation functions proposed.Experiments on spoken Chinese corpora show that these methods could reduce the Chinese character error rate for pinyin-to-character conversion to some extent,more or less,and the one considering both a trigram\u0027s confidence and style tendency achieved the best performance with character error rate reduction of 50.2% and 23.7%,respectively,compared with two baselines in this paper.
What problem does this paper attempt to address?