Abstract:Distributed optimization enables networked agents to cooperatively solve a global optimization problem even with each participating agent only having access to a local partial view of the objective function. Despite making significant inroads, most existing results on distributed optimization rely on noise-free information sharing among the agents, which is problematic when communication channels are noisy, messages are coarsely quantized, or shared information are obscured by additive noise for the purpose of achieving differential privacy. The problem of information-sharing noise is particularly pronounced in the state-of-the-art gradient-tracking based distributed optimization algorithms, in that information-sharing noise will accumulate with iterations on the gradient-tracking estimate of these algorithms, and the ensuing variance will even grow unbounded when the noise is persistent. This paper proposes a new gradient-tracking based distributed optimization approach that can avoid information-sharing noise from accumulating in the gradient estimation. The approach is applicable even when the {inter-agent interaction is} time-varying, which is key to enable the incorporation of a decaying factor in inter-agent interaction to gradually eliminate the influence of information-sharing noise. In fact, we rigorously prove that the proposed approach can ensure the almost sure convergence of all agents to the same optimal solution even in the presence of persistent information-sharing noise. The approach is applicable to general directed graphs. It is also capable of ensuring the almost sure convergence of all agents to an optimal solution when the gradients are noisy, which is common in machine learning applications. Numerical simulations confirm the effectiveness of the proposed approach.

Asymptotic Network Independence in Distributed Stochastic Optimization for Machine Learning

The Asymptotic Distribution and Berry–Esseen Bound of a New Test for Independence in High Dimension with an Application to Stochastic Optimization

Distributed Stochastic Optimization With Unbounded Subgradients Over Randomly Time-Varying Networks

Distributed Stochastic Algorithm for Global Optimization in Networked System

Convergence in High Probability of Distributed Stochastic Gradient Descent Algorithms

On Data Dependence in Distributed Stochastic Optimization

Asymptotic Properties of Primal-Dual Algorithm for Distributed Stochastic Optimization Over Random Networks

A Unified Theory of Decentralized SGD with Changing Topology and Local Updates

Stochastic, Distributed and Federated Optimization for Machine Learning

Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed Networks

Distributed Stochastic Optimization under a General Variance Condition

Gradient-tracking based Distributed Optimization with Guaranteed Optimality under Noisy Information Sharing

Fully asynchronous distributed optimization with linear convergence over directed networks

Data Dependent Convergence for Distributed Stochastic Optimization

Decentralized stochastic subgradient projection optimization algorithms over random networks

Non-Smooth Setting of Stochastic Decentralized Convex Optimization Problem Over Time-Varying Graphs

Stochastic Gradient Tracking Methods for Distributed Personalized Optimization over Networks

Distributed Stochastic Optimization with Random Communication and Computational Delays: Optimal Policies and Performance Analysis

Distributed Optimization With Personalization: A Flexible Algorithmic Framework

Asynchronous Adaptation and Learning over Networks --- Part I: Modeling and Stability Analysis

Achieving Globally Superlinear Convergence for Distributed Optimization with Adaptive Newton Method