Abstract:This article investigates the optimally distributed consensus control problem for discrete-time multiagent systems with completely unknown dynamics and computational ability differences. The problem can be viewed as solving nonzero-sum games with distributed reinforcement learning (RL), and each agent is a player in these games. First, to guarantee the real-time performance of learning algorithms, a data-based distributed control algorithm is proposed for multiagent systems using offline system interaction data sets. By utilizing the interactive data produced during the run of a real-time system, the proposed algorithm improves system performance based on distributed policy gradient RL. The convergence and stability are guaranteed based on functional analysis and the Lyapunov method. Second, to address asynchronous learning caused by computational ability differences in multiagent systems, the proposed algorithm is extended to an asynchronous version in which executing policy improvement or not of each agent is independent of its neighbors. Furthermore, an actor-critic structure, which contains two neural networks, is developed to implement the proposed algorithm in synchronous and asynchronous cases. Based on the method of weighted residuals, the convergence and optimality of the neural networks are guaranteed by proving the approximation errors converge to zero. Finally, simulations are conducted to show the effectiveness of the proposed algorithm.

Data-Driven H∞ Output Consensus for Heterogeneous Multiagent Systems Under Switching Topology via Reinforcement Learning

Consensus Seeking in Multi-Agent Systems with an Active Leader and Communication Delays.

Data-Efficient Off-Policy Learning for Distributed Optimal Tracking Control of HMAS with Unidentified Exosystem Dynamics.

Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning

Cooperative Adaptive H ∞ Output Regulation of Continuous-Time Heterogeneous Multi-Agent Markov Jump Systems

Output-Feedback-Based Adaptive Leaderless Consensus for Heterogenous Nonlinear Multiagent Systems With Switching Topologies

Distributed Event-Triggered Adaptive Control for Cooperative Output Regulation of Heterogeneous Multiagent Systems Under Switching Topology

Adaptive consensus for multi-agent systems with switched nonlinear dynamics and switching directed topologies

Data-driven output consensus for a class of discrete-time multiagent systems by reinforcement learning techniques

Dissipativity-Based Consensus Tracking of Singular Multiagent Systems With Switching Topologies and Communication Delays

Adaptive mean‐square consensus of heterogeneous multiagent systems with stochastic switching topologies

Reinforcement Learning-Based Unknown Reference Tracking Control of HMASs with Nonidentical Communication Delays

Consensus Control in Heterogeneous Nonlinear Multiagent Systems with Position Feedback and Switching Topologies

Adaptive Output Regulation of Heterogeneous Multiagent Systems Under Markovian Switching Topologies

Distributed H∞ Control of Multi-Agent Systems over Randomly Switching Topologies

Leader-Following Consensus Control for Uncertain Feedforward Stochastic Nonlinear Multiagent Systems

Data‐based distributed consensus optimal control for nonlinear multi‐agent systems under switching topology

Mean square leader‐following consensus of heterogeneous multi‐agent systems with Markovian switching topologies and communication delays

Leader-Following Consensus of Heterogeneous Linear Multiagent Systems With Communication Time-Delays via Adaptive Distributed Observers

Optimal consensus control for unknown second-order multi-agent systems: Using model-free reinforcement learning method

Optimal Tracking Control of Heterogeneous MASs Using Event-Driven Adaptive Observer and Reinforcement Learning