Pronunciation scoring model for Mandarin Phonemes based on feature comparison using a simulated annealing genetic algorithm

YE Datian
2012-01-01
Abstract:A Mandarin Phoneme pronunciation scoring model was developed to help people with difficulty in pronunciation and people learning foreign languages to rectify pronunciation errors.The method uses feature comparison of the Mel frequency cepstrum coefficient(MFCC) and a simulated annealing genetic algorithm(SAGA).The dynamic time warping(DTW) algorithm is used to evaluate the phoneme similarity,and to automatically compute the scores for these phonemes based on the SAGA scoring mechanism.This paper compares phoneme scores using different optimization algorithms(SAGA and local optimization) and different DTW algorithms.The results show that the SAGA model accuracy is better than 94%,significantly better than the local-optimization model.Moreover,the combination of SAGA and the improved DTW algorithm with a parallelogram search path resulted in the best pronunciation score.Thus,the model based on the MFCC and SAGA methods is appropriate for pronunciation scoring of Mandarin Phonemes.
What problem does this paper attempt to address?