Investigation of Tandem Deep Belief Network Approach for Phoneme Recognition

Xin Zheng,Zhiyong Wu,Binbin Shen,Helen Meng,Lianhong Cai
DOI: https://doi.org/10.1109/icassp.2013.6639138
2013-01-01
Abstract:This paper proposes using tandem DBN approach - a hierarchical architecture that consists of two or more deep belief networks (DBNs) in tandem manner - for phoneme recognition task on TIMIT. First we describe the standard DBN approach applied in phoneme recognition and discuss the motivation of combining it with tandem classifier approach. We then perform series of experiments to find out the best configuration for the DBN in the second level and discover the full potential of this method. The experiments show that for the DBN in the second level, (a) 2048 units in each hidden layer is better than 1024 and 512 units, (b) for sufficient length of temporal context, two hidden layers are better, (c) the one gives best performance on development set shows 4% relative improvement on coretest set.
What problem does this paper attempt to address?