Use Formant Trajectory to Improve the Performance of Mandarin Digit Speech Recognition

LI Husheng,YANG Mingjie,LIU Runsheng
DOI: https://doi.org/10.3321/j.issn:1000-0054.1999.09.019
1999-01-01
Abstract:In mandarin digit speech recognition (MDSR), “2” and “8” are the most confusable pair of words. The reason why “2” and “8” are often confused is analyzed. It is found that the cue to distinguish “2” and “8” is the difference between the formant trajectory of “2” and “8”. Therefore the formant trajectory based on decision algorithm (FTBD) was proposed to distinguish “2” and “8”. Experiments show that with FTBD the correct recognition rate is improved from 96.0% to 97.7% for MDSR, and from 91% to 99% for “2” and “8”, thus this confusion is removed from MDSR, and the performance of MDSR is improved.
What problem does this paper attempt to address?