Mandarin accent identification based on GMM with multi-feature fusion

Pang Cheng,Wang Xiuling,Zhang Jie,Liu Hong
DOI: https://doi.org/10.13245/j.hust.15S1090
2015-01-01
Abstract:A method based on Mel frequency cepstrum coefficients (MFCC) and formant frequency for Mandarin accent identification was proposed. Firstly, MFCC and formant frequency features were extracted for the Gaussian mixture model (GMM), which was trained by the expectation-maximization (EM) algorithm. Then, these two features were modeled. Finally, the information fusion strategy based on maximum likelihood (ML) criteria was utilized to make the final decision. The corpus consists of speech data from seven districts. After cross-validation, the results indicate that the recognition rate can reach 85.61% for the typical parts of Mandarin-speaking areas in China. Compared with the approaches of using MFCC or formant frequency features, our method increases by 6.62% and 32.90%, respectively. ©, 2015, Huazhong University of Science and Technology. All right reserved.
What problem does this paper attempt to address?