Abstract:The optimistic gradient method has seen increasing popularity for solving convex-concave saddle point problems. To analyze its iteration complexity, a recent work [<a class="link-https" data-arxiv-id="1906.01115" href="https://arxiv.org/abs/1906.01115">arXiv:1906.01115</a>] proposed an interesting perspective that interprets this method as an approximation to the proximal point method. In this paper, we follow this approach and distill the underlying idea of optimism to propose a generalized optimistic method, which includes the optimistic gradient method as a special case. Our general framework can handle constrained saddle point problems with composite objective functions and can work with arbitrary norms using Bregman distances. Moreover, we develop a backtracking line search scheme to select the step sizes without knowledge of the smoothness coefficients. We instantiate our method with first-, second- and higher-order oracles and give best-known global iteration complexity bounds. For our first-order method, we show that the averaged iterates converge at a rate of $O(1/N)$ when the objective function is convex-concave, and it achieves linear convergence when the objective is strongly-convex-strongly-concave. For our second- and higher-order methods, under the additional assumption that the distance-generating function has Lipschitz gradient, we prove a complexity bound of $O(1/\epsilon^\frac{2}{p+1})$ in the convex-concave setting and a complexity bound of $O((L_pD^\frac{p-1}{2}/\mu)^\frac{2}{p+1}+\log\log\frac{1}{\epsilon})$ in the strongly-convex-strongly-concave setting, where $L_p$ ($p\geq 2$) is the Lipschitz constant of the $p$-th-order derivative, $\mu$ is the strong convexity parameter, and $D$ is the initial Bregman distance to the saddle point. Moreover, our line search scheme provably only requires a constant number of calls to a subproblem solver per iteration on average, making our first- and second-order methods particularly amenable to implementation.

Saddle point optimization with approximate minimization oracle

Saddle Point Optimization with Approximate Minimization Oracle and Its Application to Robust Berthing Control

A Decentralized Proximal Point-type Method for Saddle Point Problems

Gradient Methods with Dynamic Inexact Oracles

Generalized Optimistic Methods for Convex-Concave Saddle Point Problems

On the saddle point problem for non-convex optimization

Decomposable Non-Smooth Convex Optimization with Nearly-Linear Gradient Oracle Complexity

A Stochastic Proximal Point Algorithm for Saddle-Point Problems

Identifying and attacking the saddle point problem in high-dimensional non-convex optimization

Riemannian stochastic optimization methods avoid strict saddle points

Acceleration with a Ball Optimization Oracle

Proximal Oracles for Optimization and Sampling

Simple and Optimal Stochastic Gradient Methods for Nonsmooth Nonconvex Optimization

Partial-Quasi-Newton Methods

On Randomized Fictitious Play for Approximating Saddle Points Over Convex Sets

An Infeasible-Point Subgradient Method Using Adaptive Approximate Projections

Partial-Quasi-Newton Methods: Efficient Algorithms for Minimax Optimization Problems with Unbalanced Dimensionality

Universality of AdaGrad Stepsizes for Stochastic Optimization: Inexact Oracle, Acceleration and Variance Reduction

Optimal inexactness schedules for tunable oracle-based methods

Proximal Point Algorithms for Nonsmooth Convex Optimization with Fixed Point Constraints

Optimal inexactness schedules for Tunable Oracle based Methods