Convergence of a modified gradient-based learning algorithm with penalty for single-hidden-layer feed-forward networks
Jian Wang,Bingjie Zhang,Zhaoyang Sang,Yusong Liu,Shujun Wu,Quan Miao
DOI: https://doi.org/10.1007/s00521-018-3748-y
2018-09-29
Neural Computing and Applications
Abstract:Based on a novel algorithm, known as the upper-layer-solution-aware (USA), a new algorithm, in which the penalty method is introduced into the empirical risk, is studied for training feed-forward neural networks in this paper, named as USA with penalty. Both theoretical analysis and numerical results show that it can control the magnitude of weights of the networks. Moreover, the deterministic theoretical analysis of the new algorithm is proved. The monotonicity of the empirical risk with penalty term is guaranteed in the training procedure. The weak and strong convergence results indicate that the gradient of the total error function with respect to weights tends to zero, and the weight sequence goes to a fixed point when the iterations approach positive infinity. Numerical experiment has been implemented and effectively verifies the proved theoretical results.
computer science, artificial intelligence