Abstract:Abstract Deep Neural Networks (DNNs) are widely regarded as the most effective learning tool for dealing with large datasets, and they have been successfully used in thousands of applications in a variety of fields. Based on these large datasets, they are trained to learn the relationships between various variables. The adaptive moment estimation (Adam) algorithm, a highly efficient adaptive optimization algorithm, is widely used as a learning algorithm in various fields for training DNN models. However, it needs to improve its generalization performance, especially when training with large-scale datasets. Therefore, in this paper, we propose HN Adam, a modified version of the Adam Algorithm, to improve its accuracy and convergence speed. The HN_Adam algorithm is modified by automatically adjusting the step size of the parameter updates over the training epochs. This automatic adjustment is based on the norm value of the parameter update formula according to the gradient values obtained during the training epochs. Furthermore, a hybrid mechanism was created by combining the standard Adam algorithm and the AMSGrad algorithm. As a result of these changes, the HN_Adam algorithm, like the stochastic gradient descent (SGD) algorithm, has good generalization performance and achieves fast convergence like other adaptive algorithms. To test the proposed HN_Adam algorithm performance, it is evaluated to train a deep convolutional neural network (CNN) model that classifies images using two different standard datasets: MNIST and CIFAR-10. The algorithm results are compared to the basic Adam algorithm and the SGD algorithm, in addition to other five recent SGD adaptive algorithms. In most comparisons, the HN Adam algorithm outperforms the compared algorithms in terms of accuracy and convergence speed. AdaBelief is the most competitive of the compared algorithms. In terms of testing accuracy and convergence speed (represented by the consumed training time), the HN-Adam algorithm outperforms the AdaBelief algorithm by an improvement of 1.0% and 0.29% for the MNIST dataset, and 0.93% and 1.68% for the CIFAR-10 dataset, respectively.

Improving Levenberg-Marquardt Algorithm for Neural Networks

Adaptive Levenberg–Marquardt Algorithm: A New Optimization Strategy for Levenberg–Marquardt Neural Networks

An Improved Levenberg-Marquardt Algorithm with Adaptive Learning Rate for Rbf Neural Network

A modified Adam algorithm for deep neural network optimization

Effective Neural Network Training with a New Weighting Mechanism-Based Optimization Algorithm.

Accelerated Levenberg-Marquardt Algorithm for Radial Basis Function Neural Network

Stability Analysis of the Modified Levenberg–Marquardt Algorithm for the Artificial Neural Network Training

Fast Computational Approach to the Levenberg-Marquardt Algorithm for Training Feedforward Neural Networks

Revisiting Recursive Least Squares for Training Deep Neural Networks

Improving Steepest Descent Method by Learning Rate Annealing and Momentum in Neural Network

Adaptive Learning Rates with Maximum Variation Averaging.

A New Adaptive Levenberg–Marquardt Method for Nonlinear Equations and Its Convergence Rate under the Hölderian Local Error Bound Condition

Improving Deep Neural Networks' Training for Image Classification with Nonlinear Conjugate Gradient-Style Adaptive Momentum.

A Convergent ADMM Framework for Efficient Neural Network Training

A modification of adaptive moment estimation (adam) for machine learning

On a New Updating Rule of the Levenberg–Marquardt Parameter

Fuzzy optimization neural network model based on LM algorithm

Accelerating the Modified Levenberg-Marquardt Method for Nonlinear Equations.

ODE-LM: A Hybrid Training Algorithm for Feedforward Neural Networks

ABNGrad: adaptive step size gradient descent for optimizing neural networks

Efficient Adaptive Optimization via Subset-Norm and Subspace-Momentum: Fast, Memory-Reduced Training with Convergence Guarantees