Score Fusion For Perceptual Evaluation Of Pronunciation Quality

Chaolei Li,Jia Liu,Change Liu,Shanhong Xia
2006-01-01
Abstract:We proposed an intelligent computation model for the perceptual evaluation of pronunciation quality in Computer Assisted Language Learning (CALL). The acoustic model and the psychoacoustic model were combined to simulate the process in which a human expert evaluated the pronunciation quality of speech. Three scores, the matching score, the perceptual score and the asymmetric score, were obtained which indicated the acoustic distortion, the perceived distortion by the human in perception domain and the asymmetric effect of the sensation of the deletion error and the insertion error in spoken English, respectively. Then we investigated the fusion of these scores by implementing linear regression, neural network and Support Vector Machine (SVM). The best correlation of 0.78 is obtained with the score fusion method of SVM, which is advantageous over current methods.
What problem does this paper attempt to address?