Abstract:Primal-dual hybrid gradient method (PDHG, a.k.a. Chambolle and Pock method) is a well-studied algorithm for minimax optimization problems with a bilinear interaction term. Recently, PDHG is used as the base algorithm for a new LP solver PDLP that aims to solve large LP instances by taking advantage of modern computing resources, such as GPU and distributed system. Most of the previous convergence results of PDHG are either on duality gap or on distance to the optimal solution set, which are usually hard to compute during the solving process. In this paper, we propose a new progress metric for analyzing PDHG, which we dub infimal sub-differential size (IDS), by utilizing the geometry of PDHG iterates. IDS is a natural extension of the gradient norm of smooth problems to non-smooth problems, and it is tied with KKT error in the case of LP. Compared to traditional progress metrics for PDHG, IDS always has a finite value and can be computed only using information of the current solution. We show that IDS monotonically decays, and it has an $\mathcal O(\frac{1}{k})$ sublinear rate for solving convex-concave primal-dual problems, and it has a linear convergence rate if the problem further satisfies a regularity condition that is satisfied by applications such as linear programming, quadratic programming, TV-denoising model, etc. The simplicity of our analysis and the monotonic decay of IDS suggest that IDS is a natural progress metric to analyze PDHG. As a by-product of our analysis, we show that the primal-dual gap has $\mathcal O(\frac{1}{\sqrt{k}})$ convergence rate for the last iteration of PDHG for convex-concave problems. The analysis and results on PDHG can be directly generalized to other primal-dual algorithms, for example, proximal point method (PPM), alternating direction method of multipliers (ADMM) and linearized alternating direction method of multipliers (l-ADMM).

Near-optimal tensor methods for minimizing the gradient norm of convex functions and accelerated primal-dual tensor methods

Near-optimal tensor methods for minimizing the gradient norm of convex functions and accelerated primal–dual tensor methods

Implementable tensor methods in unconstrained convex optimization

Optimal and parameter-free gradient minimization methods for convex and nonconvex optimization

Accelerated Bregman Proximal Gradient Methods for Relatively Smooth Convex Optimization

Accelerated nonlinear primal-dual hybrid gradient methods with applications to supervised machine learning

An Entropy Regularization Technique for Minimizing a Sum of Tchebycheff Norms

Faster Accelerated First-order Methods for Convex Optimization with Strongly Convex Function Constraints

How to Make the Gradients Small Privately: Improved Rates for Differentially Private Non-Convex Optimization

On the Infimal Sub-differential Size of Primal-Dual Hybrid Gradient Method and Beyond

A primal-dual hybrid gradient method for non-linear operators with applications to MRI

Fast Computation of Optimal Transport via Entropy-Regularized Extragradient Methods

Optimizing $(L_0, L_1)$-Smooth Functions by Gradient Methods

An Efficient Algorithm for the 𝓁p Norm Based Metric Nearness Problem

A Universal Accelerated Primal–Dual Method for Convex Optimization Problems

Two efficient gradient methods with approximately optimal stepsizes based on regularization models for unconstrained optimization

Convergence analysis of approximate primal solutions in dual first-order methods

The Complexity of Constrained Min-Max Optimization

On the Differentiability of the Primal-Dual Interior-Point Method

Primal Dual Alternating Proximal Gradient Algorithms for Nonsmooth Nonconvex Minimax Problems with Coupled Linear Constraints

Inertial Accelerated Primal-Dual Methods for Linear Equality Constrained Convex Optimization Problems.