Optimal Rates for the Last Iterate of the Stochastic subgradient Method under Heavy-Tails

Daniela Angela Parletta,Andrea Paudice,Saverio Salzo
2024-10-01
Abstract:In this paper, we provide novel optimal (or near optimal) convergence rates in expectation for the last iterate of a clipped version of the stochastic subgradient method. We consider nonsmooth convex problems, over possibly unbounded domains, under heavy-tailed noise that only possesses the first $p$ moments for $p \in (1,2]$. Our rates are of the order of $(\log k)/k^{(p-1)/p}$ and $1/k^{(p-1)/p}$ for infinite and finite horizon respectively. As a by-product, we also provide novel convergence rates for the average iterate, improving existing results by a $\log k$ factor. Preliminary experiments support our theory.
Optimization and Control
What problem does this paper attempt to address?