Training Deep Belief Network with Sparse Hidden Units.

Zhen Hu,Wenzheng Hu,Changshui Zhang
DOI: https://doi.org/10.1007/978-3-662-45646-0_2
2014-01-01
Abstract:In this paper, we proposed a framework to train Restricted Boltzmann Machine (RBM) which is the basic block for Deep Belief Network (DBN). By introducing sparsity constraint to the Contrastive Divergence algorithm (CD algorithm), we trained RBMs with better performance than the off-the-shelf model in MNIST handwritten digit data set. The sparse model suffer from saturation slightly, however, by using a trade-off coefficient, the saturation problem can be solved well. To our knowledge, the sparsity constraint was first introduced to the hidden units of RBM.
What problem does this paper attempt to address?