Beyond Minimax Optimality: A Subgame Perfect Gradient Method

Benjamin Grimmer,Kevin Shu,Alex L. Wang
2024-12-10
Abstract:The study of unconstrained convex optimization has historically been concerned with worst-case a priori convergence rates. The development of the Optimized Gradient Method (OGM), due to Drori and Teboulle, Kim and Fessler, marked a major milestone in this study, as OGM achieves the optimal worst-case convergence rate among all gradient-span first-order methods. However, this notion of worst-case optimality is relatively coarse and allows OGM to have worst-case performance even on instances where stronger convergence guarantees are possible. For example, OGM is known to converge at its worst-case rate even on the toy example $Lx^2/2$, where exact convergence in just two steps is possible. We introduce a notion of optimality which is stronger than minimax optimality that requires a method to give optimal dynamic guarantees that exploit any "non-adversarialness" in the first-order oracle's reported information. We then give an algorithm which achieves this stronger optimality notion: the Subgame Perfect Gradient Method (SPGM). SPGM is a refinement of OGM whose update rules and convergence guarantees are dynamically computed in response to first-order information seen during the algorithm's execution. From a game-theoretic viewpoint, OGM can be seen as one side of a Nash Equilibrium for the "minimization game" whereas SPGM can be seen as one side of a Subgame Perfect Equilibrium for the same game. We also show that SPGM can be implemented with minimal computational and storage overhead in each iteration and provide a Julia implementation.
Optimization and Control
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? The paper "Beyond Minimax Optimality: A Subgame Perfect Gradient Method" aims to solve the problem of the worst - case convergence rate in unconstrained convex optimization. Specifically, it attempts to go beyond the traditional minimax optimality, introduce a stronger concept of optimality, and propose a new algorithm - the Subgame Perfect Gradient Method (SPGM). #### Background and Motivation 1. **Limitations of Traditional Methods**: - In unconstrained convex optimization, previous research mainly focused on the prior convergence rate in the worst - case. - The Optimized Gradient Method (OGM), a momentum gradient method proposed by Drori and Teboulle [9] and Kim and Fessler [14], has been proven to achieve the optimal worst - case convergence rate in the class of L - smooth convex functions. - However, the concept of this worst - case optimality is relatively rough, causing OGM to perform poorly even in some cases where it can converge faster. For example, on the simple quadratic function \( f(x)=\frac{L}{2}x^{2} \), OGM still converges at the worst - case rate, while it can actually converge exactly in two steps. 2. **Introducing a Stronger Concept of Optimality**: - The paper introduces a concept stronger than minimax optimality, requiring the algorithm to be able to dynamically utilize the "non - adversarial" features in the first - order information and provide better convergence guarantees. - From the perspective of game theory, OGM can be regarded as a Nash equilibrium strategy in a minimization game, while SPGM is a subgame perfect equilibrium strategy and can achieve optimal convergence on both "easy instances" and "hard instances". #### Main Contributions 1. **Proposing the SPGM Algorithm**: - SPGM is an improvement of OGM, and its update rules and convergence guarantees are dynamically calculated based on the first - order information seen during the execution process. - SPGM can be implemented in each iteration with minimal computational and storage overhead, and a Julia implementation is provided. 2. **Theoretical Analysis**: - Through the game - theory framework, SPGM is proven to be a subgame perfect equilibrium strategy, which can dynamically utilize useful information and thus perform better in practical applications. - Theorem 1 and Theorem 3 respectively prove that SPGM can achieve the optimal convergence rate in different situations, and these results are verified in experiments. 3. **Numerical Experiments**: - Finally, the paper compares the performance of SPGM with gradient descent, OGM, and Limited - Memory BFGS [18] through preliminary numerical experiments, demonstrating the superiority of SPGM. In summary, this paper attempts to solve the limitations of existing optimization algorithms in worst - case optimality by introducing the subgame perfect gradient method and provide a more flexible and efficient optimization method.