RESEARCH AND REALISATION OF TANDEM IN MANDARIN SPEECH ERROR DETECTION SYSTEM

Gong Shu,Wei Si,Hu Guoping,Liu Qingfeng
DOI: https://doi.org/10.3969/j.issn.1000-386X.2011.07.071
2011-01-01
Abstract:The HMM/GMM framework which is often used in voice recognition is comparatively poor at mode recognition due to its limitation on training rule and algorithm.Another framework,the HMM/ANN,is better at mode classification but lacks mature and effective optimisation approaches.In the thesis,the authors apply TANDEM which holds the advantages of both frameworks to the mandarin speech error detection system.At the beginning a discriminatingly training neural network estimates rhythm level pro-check probability,then with a series of processes converts the original MFCC feature to TANDEM feature as the input to the HMM statistical model based speech error detection system,consequently completes the evaluation process.Experiment shows the TANDEM approach can greatly improve the error detection performance of the system.It works even better when cooperates with an adaptive approach such as MLLR.
What problem does this paper attempt to address?