High Performance Digit Mandarin Speech Recognition

李虎生,刘加,刘润生
DOI: https://doi.org/10.3321/j.issn:0372-2112.2001.05.005
2000-01-01
Abstract:High performance mandarin digit speech recognition (MDSR) system is developed using MFCC (mel frequency cepstrum coefficient) as the main parameter identifying the speech patterns. The formant trajectory and the nasal feature are extracted to identify confused words. A feature based, real time endpoint detection algorithm is proposed to reduce the system resource requirements and to improve the disturbance proof ability. A two stage recognition frame enhances discrimination by identifying candidate words in the first stage and confused word pairs in the second stage. These improvements result in a correct recognition rate of 98.8%.
What problem does this paper attempt to address?