Provable accelerated gradient method for nonconvex low rank optimization

Huan Li,Zhouchen Lin
DOI: https://doi.org/10.1007/s10994-019-05819-w
IF: 5.414
2019-01-01
Machine Learning
Abstract:Optimization over low rank matrices has broad applications in machine learning. For large-scale problems, an attractive heuristic is to factorize the low rank matrix to a product of two much smaller matrices. In this paper, we study the nonconvex problem min _𝐔∈ℝ^n× r g(𝐔)=f(𝐔𝐔^T) under the assumptions that f(𝐗) is restricted μ -strongly convex and L -smooth on the set {𝐗:𝐗≽ 0, rank (𝐗)≤ r} . We propose an accelerated gradient method with alternating constraint that operates directly on the 𝐔 factors and show that the method has local linear convergence rate with the optimal dependence on the condition number of √(L/μ) . Globally, our method converges to the critical point with zero gradient from any initializer. Our method also applies to the problem with the asymmetric factorization of 𝐗=𝐔𝐕^T and the same convergence result can be obtained. Extensive experimental results verify the advantage of our method.
What problem does this paper attempt to address?