Cluster-Based Senone Selection for the Efficient Calculation of Deep Neural Network Acoustic Models

Jun-Hua Liu,Zhen-Hua Ling,Si Wei,Guo-Ping Hu,Li-Rong Dai
DOI: https://doi.org/10.1109/iscslp.2016.7918399
2016-01-01
Abstract:In this paper, we propose a cluster-based senone selection method to speed up the computation of deep neural networks (DNN) at the decoding time of speech recognition. In DNN-based acoustic models, the large number of senones at the output layer is one of the main causes that lead to the high computation complexity of DNNs. Inspired by the mixture selection method designed for the Gaussian mixture model (GMM)-based acoustic modeling, only a subset of the senones at the output layer of DNNs are selected to calculate the posterior probabilities in our proposed method. The senone selection strategy is derived by clustering the acoustic inputs according to their linear outputs at the top hidden layer. Experimental results show that the number of senones need to be calculated can be reduced by 63% without significant performance loss after applying our proposed method. As a result, the overall speed of the recognition process can be accelerated by 13% compared to the baseline.
What problem does this paper attempt to address?