Improving Levenberg-Marquardt Algorithm for Neural Networks

Omead Pooladzandi,Yiming Zhou
DOI: https://doi.org/10.48550/arXiv.2212.08769
2022-12-17
Abstract:We explore the usage of the Levenberg-Marquardt (LM) algorithm for regression (non-linear least squares) and classification (generalized Gauss-Newton methods) tasks in neural networks. We compare the performance of the LM method with other popular first-order algorithms such as SGD and Adam, as well as other second-order algorithms such as L-BFGS , Hessian-Free and KFAC. We further speed up the LM method by using adaptive momentum, learning rate line search, and uphill step acceptance.
Machine Learning
What problem does this paper attempt to address?