Multi-stage stochastic gradient method with momentum acceleration

Zhijian Luo,Siyu Chen,Yuntao Qian,Yueen Hou
DOI: https://doi.org/10.1016/j.sigpro.2021.108201
IF: 4.729
2021-01-01
Signal Processing
Abstract:Multi-stage optimization which invokes a stochastic algorithm restarting with the returned solution of previous stage, has been widely employed in stochastic optimization. Momentum acceleration technique is famously known for building gradient-based algorithms with fast convergence in large-scale optimiza-tion. In order to take the advantage of this acceleration in multi-stage stochastic optimization, we develop a multi-stage stochastic gradient descent with momentum acceleration method, named MAGNET, for first -order stochastic convex optimization. The main ingredient is the employment of a negative momentum, which extends the Nesterov's momentum to the multi-stage optimization. It can be incorporated in a stochastic gradient-based algorithm in multi-stage mechanism and provide acceleration. The proposed algorithm obtains an accelerated rate of convergence, and is adaptive and free from hyper-parameter tun -ing. The experimental results demonstrate that our algorithm is competitive with some state-of-the-art methods for solving several typical optimization problems in machine learning. (c) 2021 Elsevier B.V. All rights reserved.
What problem does this paper attempt to address?