Sign Stochastic Gradient Descents Without Bounded Gradient Assumption for the Finite Sum Minimization

Tao Sun,Dongsheng Li
DOI: https://doi.org/10.1016/j.neunet.2022.02.012
IF: 7.8
2022-01-01
Neural Networks
Abstract:Sign-based Stochastic Gradient Descents (Sign-based SGDs) use the signs of the stochastic gradients for communication costs reduction. Nevertheless, current convergence results of sign-based SGDs applied to the finite sum optimization are established on the bounded assumption of the gradient, which fails to hold in various cases. This paper presents a convergence framework about sign-based SGDs with the elimination of the bounded gradient assumption. The ergodic convergence rates are provided only with the smooth assumption of the objective functions. The Sign Stochastic Gradient Descent (signSGD) and its two variants, including majority vote and zeroth-order version, are developed for different application settings. Our framework also removes the bounded gradient assumption used in the previous analysis of these three algorithms.
What problem does this paper attempt to address?