Abstract:In this paper, we consider a strongly convex finite-sum minimization problem over a decentralized network and propose a communication-efficient decentralized Newton's method for solving it. We first apply dynamic average consensus (DAC) so that each node is able to use a local gradient approximation and a local Hessian approximation to track the global gradient and Hessian, respectively. Second, since exchanging Hessian approximations is far from communication-efficient, we require the nodes to exchange the compressed ones instead and then apply an error compensation mechanism to correct for the compression noise. Third, we introduce multi-step consensus for exchanging local variables and local gradient approximations to balance between computation and communication. To avoid each node transmitting the entire local Hessian approximation, we design a compression procedure with error compensation to estimate the global Hessian in a communication-efficient way. With novel analysis, we establish the globally linear (resp., asymptotically super-linear) convergence rate of the proposed method when m is constant (resp., tends to infinity), where m is the number of consensus inner steps. To the best of our knowledge, this is the first super-linear convergence result for a communication-efficient decentralized Newton's method. Moreover, the rate we establish is provably faster than those of first-order methods. Our numerical results on various applications corroborate the theoretical findings.

Distributed adaptive Newton methods with global superlinear convergence

Distributed Adaptive Newton Methods with Globally Superlinear Convergence

Achieving Globally Superlinear Convergence for Distributed Optimization with Adaptive Newton Method

Distributed Optimization Algorithm with Superlinear Convergence Rate

On Convergence of Distributed Approximate Newton Methods: Globalization, Sharper Bounds and Beyond

A Communication-Efficient Decentralized Newton's Method with Provably Faster Convergence

Distributed Inexact Newton Method with Adaptive Step Sizes

Distributed Stochastic Algorithm for Global Optimization in Networked System

Distributed finite-time optimization algorithms with a modified Newton–Raphson method

Accelerated Primal-Dual Algorithms for Distributed Smooth Convex Optimization over Networks

Distributed Adaptive Greedy Quasi-Newton Methods with Explicit Non-asymptotic Convergence Bounds

Distributed Algorithms for Composite Optimization: Unified Framework and Convergence Analysis

A Push-Pull Gradient Method for Distributed Optimization in Networks.

Distributed Zeroth-Order Optimization: Convergence Rates That Match Centralized Counterpart

Convergence in High Probability of Distributed Stochastic Gradient Descent Algorithms

Sublinear and Linear Convergence of Modified ADMM for Distributed Nonconvex Optimization

Achieving Near-Optimal Convergence for Distributed Minimax Optimization with Adaptive Stepsizes

Distributed Optimization With Personalization: A Flexible Algorithmic Framework

Q-linear Convergence of Distributed Optimization with Barzilai-Borwein Step Sizes

Distributed Nonconvex Optimization: Gradient-free Iterations and $ε$-Globally Optimal Solution