Crosslingual Acoustic Modeling in Uyghur Speech Recognition

NURMEMET Yolwas,LIU Junhua,WUSHOUR Silamu,REYIMAN Tursun,DAWEL Abilhayer
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2018.22.020
2018-01-01
Abstract:The Uyghur language has a little speech data for training acoustic models due to various data acquisition and annotation difficulties. This paper describes a modeling method for crosslingual acoustic models based on long short-term memory models. Mass Chinese language training data is used to train a deep neural network acoustic model. The network output layer weights are then randomly modified to create the output layer for the Uyghur language. A Uyghur language acoustic model is then trained using Uyghur language speech data to update all the weights. Tests show that this method reduces the word error rates of the Uyghur language transcription and dictation recognition by 20% and 30% than the baseline system. Thus, this method improves the Uyghur language acoustic model with better initial weights from the Chinese language data to train hidden layers in the neural network, and enhances the network robustness.
What problem does this paper attempt to address?