Use Tone Detection to Improve Performance of Mandarin Digit Speech Recognition

Runsheng Liu
1998-01-01
Abstract:Confusion between digits such as “2” and “8” has been the main error source in mandarin digit speech recognition (MDSR). Tone detection is introduced into MDSR to solve this problem. A series of methodologies for high performance pitch contour estimation are developed, including length varied average magnitude difference function (LVAMDF), vowel center location, multi period to single period pitch adjustment, pitch neighborhood searching, etc. The mandarin digit tone detection (MDTD) algorithm is then designed for MDSR tone detection. Experiments show that the new methodologies and algorithms increase MDSR correct recognition rate from 95.2% to 98.5% , and improve the correct recognition rate between digit “2” and “8” from 90.5% to 98.8%, thus basically remove this confusion from MDSR.
What problem does this paper attempt to address?