Application of Group Delay Spectrum Parameters in Mandarin Digit Speech Recognition

Feng ZHOU,Yi-biao YU
DOI: https://doi.org/10.16798/j.issn.1003-0530.2017.09.008
2017-01-01
Journal of Signal Processing
Abstract:The high confusion between Chinese digits directly affects the performance of Chinese digit speech recognition.Traditional methods are difficult to make an effective distinction between easy-confused digits.This paper presents a multiparameter and multi-level recognition strategy.Firstly the digits are recognized by Mel spectral parameters based on HMM,then take secondary classification for the easy-confused digits using RRCGD-CC (Reflected Roots Chirp Group Delay-Cepstral Coefficients),which is a new parameter based on group delay spectrum,and SVM.The experimental results show that the recognition rate of "2" and "8" is improved by 8%,and the recognition rate of the system is improved by 2.3%.This result is fully explained that the RRCGD-CC is valid for easily confused digits.
What problem does this paper attempt to address?