Abstract:We study the stability of accuracy during the training of deep neural networks (DNNs). In this context, the training of a DNN is performed via the minimization of a cross-entropy loss function, and the performance metric is accuracy (the proportion of objects that are classified correctly). While training results in a decrease of loss, the accuracy does not necessarily increase during the process and may sometimes even decrease. The goal of achieving stability of accuracy is to ensure that if accuracy is high at some initial time, it remains high throughout training. A recent result by Berlyand, Jabin, and Safsten introduces a doubling condition on the training data, which ensures the stability of accuracy during training for DNNs using the absolute value activation function. For training data in , this doubling condition is formulated using slabs in and depends on the choice of the slabs. The goal of this paper is twofold. First, to make the doubling condition uniform, that is, independent of the choice of slabs. This leads to sufficient conditions for stability in terms of training data only. In other words, for a training set T that satisfies the uniform doubling condition, there exists a family of DNNs such that a DNN from this family with high accuracy on the training set at some training time will have high accuracy for all time . Moreover, establishing uniformity is necessary for the numerical implementation of the doubling condition. We demonstrate how to numerically implement a simplified version of this uniform doubling condition on a dataset and apply it to achieve stability of accuracy using a few model examples. The second goal is to extend the original stability results from the absolute value activation function to a broader class of piecewise linear activation functions with finitely many critical points, such as the popular Leaky ReLU.

Do stable neural networks exist for classification problems? -- A new view on stability in AI

Stability for the training of deep neural networks and other classifiers

The Boundaries of Verifiable Accuracy, Robustness, and Generalisation in Deep Learning

Measuring and Mitigating Local Instability in Deep Neural Networks

Stable Analysis for Neural Networks: Set-valued Mapping Method

Prediction Stability: A New Metric for Quantitatively Evaluating DNN Outputs

Asymptotical Stability in Discrete-Time Neural Networks

Neural Processes with Stability

Forward Stability of ResNet and Its Variants

Complete Stability of Neural Networks with Nonmonotonic Piecewise Linear Activation Functions

A New Stability Criterion for Discrete-Time Neural Networks: Nonlinear Spectral Radius

Neural Network Optimal Feedback Control with Guaranteed Local Stability

Stability Analysis of Switched Linear Systems with Neural Lyapunov Functions

Robust stabilization of polytopic systems via fast and reliable neural network-based approximations

Stability and Convergence Analysis for a Class of Neural Networks

Stability of accuracy for the training of DNNs via the uniform doubling condition

On the Stability and Convergence of Physics Informed Neural Networks

Robust Stability of Neural Network-controlled Nonlinear Systems with Parametric Variability

New Necessary and Sufficient Conditions for Absolute Stability of Neural Networks

A Class of Connection Patterns for Neural Networks with Absolute Stability

Stable Attractors for Neural networks classification via Ordinary Differential Equations (SA-nODE)