Deep belief networks with self-adaptive sparsity

Chen Qiao,Lan Yang,Yan Shi,Hanfeng Fang,Yanmei Kang
DOI: https://doi.org/10.1007/s10489-021-02361-y
IF: 5.3
2021-04-26
Applied Intelligence
Abstract:To have the sparsity of deep neural networks is crucial, which can improve the learning ability of them, especially for application to high-dimensional data with small sample size. Commonly used regularization terms for keeping the sparsity of deep neural networks are based on <i>L</i><sub>1</sub>-norm or <i>L</i><sub>2</sub>-norm; however, they are not the most reasonable substitutes of <i>L</i><sub>0</sub>-norm. In this paper, based on the fact that the minimization of a log-sum function is one effective approximation to that of <i>L</i><sub>0</sub>-norm, the sparse penalty term on the connection weights with the log-sum function is introduced. By embedding the corresponding iterative re-weighted-<i>L</i><sub>1</sub> minimization algorithm with <i>k</i>-step contrastive divergence, the connections of deep belief networks can be updated in a way of sparse self-adaption. Experiments on two kinds of biomedical datasets which are two typical small sample size datasets with a large number of variables, i.e., brain functional magnetic resonance imaging data and single nucleotide polymorphism data, show that the proposed deep belief networks with self-adaptive sparsity can learn the layer-wise sparse features effectively. And results demonstrate better performances including the identification accuracy and sparsity capability than several typical learning machines.
computer science, artificial intelligence
What problem does this paper attempt to address?