Abstract:We consider distributed optimization problems in which a number of agents are to seek the global optimum of a sum of cost functions through only local information sharing. In this paper, we are particularly interested in scenarios, where agents are operating asynchronously over stochastic networks subject to random failures. Most existing algorithms require coordinated and decaying stepsizes to ensure zero gap between the estimated value of each agent and the exact optimum, restricting it from asynchronous implementation and resulting in slower convergence results. To deal with this issue, we develop a new asynchronous distributed gradient method (Asyn-DGM) based on consensus theory. The proposed algorithm not only allows for asynchronous implementation in a completely distributed manner but also, most importantly, is able to seek the exact optimum even with constant stepsizes. We will show that the assumption of boundedness of gradients, which is widely used in the literature, can be dropped by instead imposing the standard Lipschitz continuity condition on gradients. Moreover, we derive an upper bound of stepsize within which the proposed AsynDGM can achieve a linear convergence rate for strongly convex functions with Lipschitz gradients. A canonical example of sensor fusion problems is provided to illustrate the effectiveness of the proposed algorithm.

Coordinate-Descent Diffusion Learning by Networked Agents.

Dynamic Average Diffusion with randomized Coordinate Updates

Diffusion Strategies Outperform Consensus Strategies for Distributed Estimation over Adaptive Networks

Convergence behavior of diffusion stochastic gradient descent algorithm

Augmented Distributed Gradient Methods for Multi-Agent Optimization under Uncoordinated Constant Stepsizes

Convergence in High Probability of Distributed Stochastic Gradient Descent Algorithms

Diffusion Adaptation over Networks with Kernel Least-Mean-square

Maximum Total Correntropy Diffusion Adaptation over Networks with Noisy Links

Multitask Diffusion Adaptation over Networks

Improving the Transient Times for Distributed Stochastic Gradient Methods

Convergence of Asynchronous Distributed Gradient Methods over Stochastic Networks

CEDAS: A Compressed Decentralized Stochastic Gradient Method with Improved Convergence

A Coordinate Descent Primal-Dual Algorithm and Application to Distributed Asynchronous Optimization

Gradient Diversity: a Key Ingredient for Scalable Distributed Learning.

On the Diffusion Approximation of Nonconvex Stochastic Gradient Descent

Distributed Analysis Dictionary Learning Using a Diffusion Strategy

An Accelerated Gossip-Based Distributed Gradient Method

Local AdaAlter: Communication-Efficient Stochastic Gradient Descent with Adaptive Learning Rates

Novel Convergence Results of Adaptive Stochastic Gradient Descents

Affine Combination of Diffusion Strategies over Networks

Exact Subspace Diffusion for Decentralized Multitask Learning