Acoustic Model Reconstruction for Multi-Accent Chinese Speech Recognition

Zheng Thomas
2011-01-01
Abstract:The acoustic likelihood score is used as a confidence measure to generate reliable accent-specific units and to merge such reliable accent-specific units through acoustic model reconstruction.The decision tree merge and acoustic model reconstruction efficiencies are improved by reducing redundant Gaussian components through an incremental decision tree merge procedure and selection of Gaussian components according to their dominance.Tests on Cantonese and Wu accents show that this approach yields significant 9.25% and 9.21% absolute syllable error rate(SER) reductions without degrading the performance on standard Putonghua.
What problem does this paper attempt to address?