Speech recognition adaptive clustering feature extraction algorithms based on the k-means algorithm and the normalized intra-class variance

Xi Xiao,Lu Zhou
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2017.22.050
2017-01-01
Abstract:The inter-frame independence assumption for speech recognition simplifies the computations. However, it also reduces the model accuracy and can easily give rise to recognition errors. Therefore, the objective of this paper is to search for a feature which can weaken the inter-frame dependence of the speech features and keep as much information of the original speech as possible. Two speech recognition feature extraction algorithms are given based on the k-means algorithm and the normalized intra-class variance. These algorithms provide adaptive clustering feature extraction. Speech recognition tests with these algorithms on a Gaussian mixture model-hidden Markov model(GMM-HMM), a duration distribution based HMM (DDBHMM), and a context dependent deep neural network HMM (CD-DNN-HMM)show that the adaptive feature based on the normalized intra-class variance reduces the relative recognition error rates by 10. 53%, 5. 17%, and 2. 65%relative to the original features. Thus, this adaptive clustering feature extraction algorithm provides improved speech recognition.
What problem does this paper attempt to address?