Automatic Mispronunciation Detection for Mandarin Chinese

ZHANG Feng,HUANG Chao,DAI Lirong
DOI: https://doi.org/10.3969/j.issn.1003-0077.2010.02.015
2010-01-01
Abstract:The current automatic mispronunciation detection systems are mostly based on automatic speech recognition(ASR) framework with statistical model.This paper presents the methods to improve the performance of mispronunciation detection at syllable level for Mandarin Chinese from two aspects: introducing the speaker adaptive training(SAT) and the selective maximum likelihood linear regression(SMLLR) to get a better acoustic statistical model,and proposing speaker normalization backend because of the limited information and the different rating level for the different pronunciation level.Experiments on a database of 8000 syllables pronounced by 40 speakers with varied pronunciation proficiency indicate the promising effects of these strategies by improving the precision from 45.8% to 53.6% at 30% recall,and 64.6% to 79.9% at 10% recall.
What problem does this paper attempt to address?