Abstract:In this paper, we consider a class of finite-sum convex optimization problems whose objective function is given by the average of m(≥1)\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$m\, ({\ge }1)$$\end{document} smooth components together with some other relatively simple terms. We first introduce a deterministic primal–dual gradient (PDG) method that can achieve the optimal black-box iteration complexity for solving these composite optimization problems using a primal–dual termination criterion. Our major contribution is to develop a randomized primal–dual gradient (RPDG) method, which needs to compute the gradient of only one randomly selected smooth component at each iteration, but can possibly achieve better complexity than PDG in terms of the total number of gradient evaluations. More specifically, we show that the total number of gradient evaluations performed by RPDG can be O(m)\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${{\mathcal {O}}} (\sqrt{m})$$\end{document} times smaller, both in expectation and with high probability, than those performed by deterministic optimal first-order methods under favorable situations. We also show that the complexity of the RPDG method is not improvable by developing a new lower complexity bound for a general class of randomized methods for solving large-scale finite-sum convex optimization problems.

General Optimal Step-size for ADMM-type Algorithms: Domain Parametrization and Optimal Rates

A Generic Closed-form Optimal Step-size for ADMM

On the dual step length of the alternating direction method of multipliers

Optimal parameter selection for the alternating direction method of multipliers (ADMM): quadratic problems

AGDA+: Proximal Alternating Gradient Descent Ascent Method With a Nonmonotone Adaptive Step-Size Search For Nonconvex Minimax Problems

Determining Step Sizes in Geometric Optimization Algorithms.

Two efficient gradient methods with approximately optimal stepsizes based on regularization models for unconstrained optimization

Towards Scalable Semidefinite Programming: Optimal Metric ADMM with A Worst-case Performance Guarantee

An optimal randomized incremental gradient method

Stochastic Primal–Dual Hybrid Gradient Algorithm with Adaptive Step Sizes

Optimizing ADMM and Over-Relaxed ADMM Parameters for Linear Quadratic Problems

Stochastic linearized generalized alternating direction method of multipliers: Expected convergence rates and large deviation properties

A Strengthened Conjecture on the Minimax Optimal Constant Stepsize for Gradient Descent

Monitoring the Convergence Speed of PDHG to Find Better Primal and Dual Step Sizes

Generalized Symmetric ADMM for Separable Convex Optimization

An Optimal ADMM for Unilateral Obstacle Problems

Understanding the ADMM Algorithm via High-Resolution Differential Equations

On the (linear) convergence of Generalized Newton Inexact ADMM

Convergence Revisit on Generalized Symmetric ADMM

A Bregman-Style Improved ADMM and its Linearized Version in the Nonconvex Setting: Convergence and Rate Analyses

A New Fast Adaptive Linearized Alternating Direction Multiplier Method for Convex Optimization