Research on speech recognition models in the Chinese dictation machine

Fang Zheng,Wenhu Wu,Ditang Fang
1997-01-01
Abstract:The speech recognition model and the language model are two extremely important components in the Chinese dictation machines. The performance of the language model and the dictation machine will be affected directly by that of the speech model. A great deal of experiments on speech recognition units, speech recognition models and the forms of scoring methods for output observation vectors have been done based on a giant speech corpus. The result is that best performance can be achieved by choosing the syllable as the speech recognition unit, using the CDN (center-distance normal)-distribution-based CDCPM (center distance-continuous probability model), and adopting the NN (nearest neighbor)-based scoring scheme, i.e., the embedded multi-model (EMM) scheme.
What problem does this paper attempt to address?