Batch Gradient Training Method with Smoothing Group L_0 Regularization for Feedfoward Neural Networks

Ying Zhang,Jianing Wei,Dongpo Xu,Huisheng Zhang
DOI: https://doi.org/10.1007/s11063-022-10956-w
IF: 2.565
2022-01-01
Neural Processing Letters
Abstract:L_0 regularization is an ideal pruning method for neural networks as it can generate the sparsest results of all L_p regularization method. However, the solving of L_0 regularization is an NP-hard problem, and the existing training algorithm with L_0 regularization can only prune the networks weights, but not neurons. To this end, in this paper we propose a batch gradient training method with smoothing Group L_0 regularization ( BGSGL_0 ). BGSGL_0 not only overcomes the NP-hard nature of the L_0 regularizer, but also prunes the network from the neuron level. The working mechanism for BGSGL_0 to prune hidden neurons is analysed, and the convergence is theoretically established under mild conditions. Simulation results are provided to validate the theoretical finding and the the superiority of the proposed algorithm.
What problem does this paper attempt to address?