Stochastic Differential Equations for Modeling First Order Optimization Methods

M. Dambrine,Ch. Dossal,B. Puig,A. Rondepierre
DOI: https://doi.org/10.1137/21m1435665
IF: 2.763
2024-04-12
SIAM Journal on Optimization
Abstract:SIAM Journal on Optimization, Volume 34, Issue 2, Page 1402-1426, June 2024. In this article, a family of SDEs are derived as a tool to understand the behavior of numerical optimization methods under random evaluations of the gradient. Our objective is to transpose the introduction of continuous versions through ODEs to understand the asymptotic behavior of a discrete optimization scheme to the stochastic setting. We consider a continuous version of the stochastic gradient scheme and of a stochastic inertial system. This article first studies the quality of the approximation of the discrete scheme by an SDE when the step size tends to 0. Then, it presents new asymptotic bounds on the values [math], where [math] is a solution of the SDE and [math], when [math] is convex and under integrability conditions on the noise. Results are provided under two sets of hypotheses: first considering [math] and convex functions and then adding some geometrical properties of [math]. All of these results provide insight on the behavior of these inertial and perturbed algorithms in the setting of stochastic algorithms.
mathematics, applied
What problem does this paper attempt to address?