Easytalk: a large-vocabulary speaker-independent Chinese dictation machine

Fang Zheng,Zhanjiang Song,Mingxing Xu,Jian Wu,Yinfei Huang,Wenhu Wu,Cheng Bi
DOI: https://doi.org/10.21437/eurospeech.1999-199
1999-01-01
Abstract:The EasyTalk application is a large-vocabulary speaker-independent continuous Chinese speech recognition system, i.e. Chinese dictation machine (CDM), under the WINTEL environment. Addressed in this paper are a number of novel techniques adopted in the CDM engine which is the basis of EasyTalk, including the merging-based syllable detection automaton (MBSDA) and the statistical knowledge based frame synchronous search (SKB-FSS) algorithms in the acoustic processing stage, the percentage in critical area (CAP) and recognition score gap (RSG) methods for the acceptation and rejection decision, the word search tree (WST), the N-Gram, and the syllable synchronous network search (SSNS) algorithm in the language processing stage, the embedded multiple model sheme (EMM) and the fuzzy syllable set (FSS) for the robustness purpose.
What problem does this paper attempt to address?