Improve Mispronunciation Detection with Tandem Feature

Hua Yuan,Junhong Zhao,Jia Liu
DOI: https://doi.org/10.1109/iscslp.2012.6423538
2012-01-01
Abstract:This paper presents a method to improve the mispronunciation detection performance for low-resource acoustic model. The 1h speech data is randomly selected from CU-CHLOE to imitate the low-resource non-native English situation. The Tandem feature derived from articulatory based Multi-Layer Perception (MLP) is employed to replace the traditional spectral feature (e.g. PLP). Further, motivated by similar pronunciation characteristics between Chinese speaking English and Mandarin, the Mandarin speech data is used to assist in training the multilingual articulatory MLPs. The Tandem feature is also combined with PLP to improve the performance. Finally, the phone recognition correctness (CORR) is improved by 3.84%, and the diagnosis accuracy (DA) is improved by 2.25% with the proposed method.
What problem does this paper attempt to address?