Reducing pronunciation lexicon confusion and using more data without phonetic transcription for pronunciation modeling

Thomas Fang Zheng,Zhanjiang Song,Pascale Fung,William Byrne
DOI: https://doi.org/10.21437/icslp.2002-641
2002-01-01
Abstract:The multiple-pronunciation lexicon (MPL) is very important to model the pronunciation variations for spontaneous speech recognition. But the introduction of MPL brings out two problems. First, the MPL will increase the among-lexicon confusion and degrade the recognizer's performance. Second, the MPL needs more data with phonetic transcription so as to cover as many surface forms as possible. Accordingly, two solutions are proposed, they are the context-dependent weighting method and the iterative forced-alignment based transcription method. The use of them can compensate what the MPL causes and improve the overall performance. Experiments across a naturally spontaneous speech database show that the proposed methods are effective and better than other methods.
What problem does this paper attempt to address?