Super-Universal Regularized Newton Method
Nikita Doikov,Konstantin Mishchenko,Yurii Nesterov
DOI: https://doi.org/10.1137/22m1519444
IF: 2.763
2024-01-05
SIAM Journal on Optimization
Abstract:SIAM Journal on Optimization, Volume 34, Issue 1, Page 27-56, March 2024. We analyze the performance of a variant of the Newton method with quadratic regularization for solving composite convex minimization problems. At each step of our method, we choose a regularization parameter proportional to a certain power of the gradient norm at the current point. We introduce a family of problem classes characterized by the Hölder continuity of either the second or third derivative. Then we present the method with a simple adaptive search procedure allowing an automatic adjustment to the problem class with the best global complexity bounds, without knowing specific parameters of the problem. In particular, for the class of functions with a Lipschitz continuous third derivative, we get the global [math] rate, which was previously attributed to third-order tensor methods. When the objective function is uniformly convex, we justify an automatic acceleration of our scheme, resulting in a faster global rate and local superlinear convergence. The switching between the different rates (sublinear, linear, and superlinear) is automatic. Again, for that, no a priori knowledge of parameters is needed.
mathematics, applied