Abstract:Decentralized stochastic optimization is the basic building block of modern collaborative machine learning, distributed estimation and control, and large-scale sensing. Since involved data usually contain sensitive information like user locations, healthcare records and financial transactions, privacy protection has become an increasingly pressing need in the implementation of decentralized stochastic optimization algorithms. In this paper, we propose a decentralized stochastic gradient descent algorithm which is embedded with inherent privacy protection for every participating agent against other participating agents and external eavesdroppers. This proposed algorithm builds in a dynamics based gradient-obfuscation mechanism to enable privacy protection without compromising optimization accuracy, which is in significant difference from differential-privacy based privacy solutions for decentralized optimization that have to trade optimization accuracy for privacy. The dynamics based privacy approach is encryption-free, and hence avoids incurring heavy communication or computation overhead, which is a common problem with encryption based privacy solutions for decentralized stochastic optimization. Besides rigorously characterizing the convergence performance of the proposed decentralized stochastic gradient descent algorithm under both convex objective functions and non-convex objective functions, we also provide rigorous information-theoretic analysis of its strength of privacy protection. Simulation results for a distributed estimation problem as well as numerical experiments for decentralized learning on a benchmark machine learning dataset confirm the effectiveness of the proposed approach.

Centralized Optimization for Dec-POMDPs under the Expected Average Reward Criterion

Decentralized Policy Optimization

Finding Optimal Observation-Based Policies for Constrained POMDPs under the Expected Average Reward Criterion

Finding Optimal Memoryless Policies of POMDPs under the Expected Average Reward Criterion

Model-Based Decentralized Policy Optimization

Monte-Carlo Search for an Equilibrium in Dec-POMDPs

Learning for Decentralized Control of Multiagent Systems in Large, Partially-Observable Stochastic Environments

Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach

Privacy-Preserving Push-Pull Method for Decentralized Optimization via State Decomposition

Observation-Based Optimization for POMDPs with Continuous State, Observation, and Action Spaces.

Decentralized Natural Policy Gradient with Variance Reduction for Collaborative Multi-Agent Reinforcement Learning

Scalable Model-based Policy Optimization for Decentralized Networked Systems

Gradient-Type Methods For Decentralized Optimization Problems With Polyak-Łojasiewicz Condition Over Time-Varying Networks

Landscape of Policy Optimization for Finite Horizon MDPs with General State and Action

ADMM Based Privacy-preserving Decentralized Optimization

An Optimal Stochastic Algorithm for Decentralized Nonconvex Finite-sum Optimization

An Efficient Stochastic Algorithm for Decentralized Nonconvex-Strongly-Concave Minimax Optimization

Decentralized Control of Partially Observable Markov Decision Processes using Belief Space Macro-actions

Decentralized Stochastic Optimization with Inherent Privacy Protection

Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations