Semantic model for Chinese phoneme-to-character transcription

Jianping Zhang,Zuoying Wang
IF: 1.019
1999-01-01
Chinese Journal of Electronics
Abstract:In this paper, a semantic model is used in decoding Chinese syllable sequences into character sequences. The character right rate is 97.82%. This result is better than a traditional bigram language model which is 93.2% and a POS+bigram model which is 95.1%. Also, a hybrid model which is the mixture of a trigram model and a semantic model is investigated and a perplexity of 32.07 is achieved which is lower than the perplexity of 36.0 in [2]. Furthermore, traditional word-based trigram models and refined word-based trigram models in which each word with different meanings is considered as different identities are studied. A similar word right rate of 95.5% is obtained when these two models are applied.
What problem does this paper attempt to address?