Study on Continuous Speech Recognition based on Bottleneck Features for Lhasa-Tibetan Dialect

ZHOU Nan,ZHAO Yue,LI Yaoqiang,XU Xiaona,CAIWANG Lamu,WU Licheng
2017-01-01
Abstract:At present, deep neural network has been widely used in speech recognition, although it has high robustness and semantic distinction, but its posterior features cannot be used for GMM-HMM acoustic modeling framework. However, the neural network with a narrow bottleneck can solve this problem, and its bottleneck features not only have long term context-dependence and compact representation of speech signal, but also can replace the traditional MFCC features for GMM-HMM acoustic modeling. In this paper, we study on applying bottleneck features and its concatenated features with MFCC into Lhasa-Tibetan continuous speech recognition. The experimental results show that the concatenated features of bottleneck features and MFCC achieved better performance than the posterior features of deep neural network and mono-bottleneck features.
What problem does this paper attempt to address?