Algorithms with Gradient Clipping for Stochastic Optimization with Heavy-Tailed Noise

M. Danilova
DOI: https://doi.org/10.1134/s1064562423701144
2024-03-12
Doklady Mathematics
Abstract:This article provides a survey of the results of several research studies [12–14, 26], in which open questions related to the high-probability convergence analysis of stochastic first-order optimization methods under mild assumptions on the noise were gradually addressed. In the beginning, we introduce the concept of gradient clipping, which plays a pivotal role in the development of stochastic methods for successful operation in the case of heavy-tailed distributions. Next, we examine the importance of obtaining the high-probability convergence guarantees and their connection with in-expectation convergence guarantees. The concluding sections of the article are dedicated to presenting the primary findings related to minimization problems and the results of numerical experiments.
mathematics
What problem does this paper attempt to address?