A two-stage mispronunciation detection approach for computer-assisted pronunciation training

Hua Yuan,Junhong Zhao,Jia Liu
2011-01-01
Abstract:In this paper, we propose a two-stage mispronunciation detection approach for computer-assisted pronunciation training. In the first stage, the selected phonological rules are used to cooperate with ASR to detect mispronunciations based on language transfer. Because the first stage detection can only deal with the pronunciation errors in the scope of the phonological rules, and detection performance is depressed with the imperfect phoneme acoustic model. The rescoring method based on duration normalized log posterior probability (NLPP) is employed in the second stage to identify the recognition speech unit again. Furthermore, a new F α-score ranking criterion is proposed for the first stage to balance the mispronunciation coverage and recognition confusion, in the aim of minimizing the cost of total detection errors. The experiment shows that the method only with phonological rules gets a best performance of 19991 total detection errors, and the normalized log posterior probability method costs 22264 total errors. Finally, the two-stage detection approach can reduce the total errors to 19498.
What problem does this paper attempt to address?