Abstract:This paper studies the cooperative tracking control problem of interacted multi‐agent systems under undirected communication. An infinite horizon cooperative differential graphical game‐theoretic tracking control framework along with a data‐driven off‐policy integral reinforcement learning scheme is proposed, where several theoretical results are rigorously established. Simulated results are conducted to validate the effectiveness of the proposed game and IRL‐based tracking control method. This paper studies the cooperative tracking control problem of interacted multi‐agent systems (MASs) under undirected communication. Based on differential graphical game theory, the MAS tracking control problem is formulated as an infinite horizon cooperative differential graphical game‐theoretic tracking control framework, where a multi‐objective optimization problem is designed and then cast into a Pareto‐equivalent single‐objective optimization problem using a scalarization method. Necessary and sufficient conditions for the existence of the Pareto‐optimal strategy to the game theoretic tracking control are established, where it has been proven that the solution to the integral Bellman optimality equation leads to Pareto‐optimal strategy. Then, an off‐policy integral reinforcement learning scheme to find optimal control strategy using a pure data‐driven manner is developed, which consumes less computation efforts than the traditional learning scheme. Simulated results are conducted to validate the effectiveness of the proposed game and IRL‐based tracking control method.

Data-Driven Inverse Cooperative Game Control Via Off-Policy Q-Learning

Cooperative Path Following Control in Autonomous Vehicles Graphical Games: A Data-Based Off-Policy Learning Approach

Learning Human Behavior in Shared Control: Adaptive Inverse Differential Game Approach

Inverse optimal stabilization of cooperative control in networked multi-agent systems

Differential graphical game‐based multi‐agent tracking control using integral reinforcement learning

Reinforcement Learning for Inverse Non-Cooperative Linear-Quadratic Output-feedback Differential Games

A Combined Policy Gradient and Q-learning Method for Data-driven Optimal Control Problems

Human-in-the-loop Distributed Cooperative Tracking Control with Applications to Autonomous Ground Vehicles: A Data-Driven Mixed Iteration Approach

Data-Efficient Off-Policy Learning for Distributed Optimal Tracking Control of HMAS with Unidentified Exosystem Dynamics.

A Data-Driven Approach for Inverse Optimal Control

Value Iteration-Based Cooperative Adaptive Optimal Control for Multi-Player Differential Games With Incomplete Information

Indirect Shared Control Through Non-Zero Sum Differential Game for Cooperative Automated Driving

Inverse linear quadratic dynamic games using partial state observations

Data-driven cooperative optimal output regulation for linear discrete-time multi-agent systems by online distributed adaptive internal model approach

Inverse reinforcement learning methods for linear differential games

Gradient-based Cooperative Control of quasi-Linear Parameter Varying Vehicles with Noisy Gradients

3DIOC: Direct Data-Driven Inverse Optimal Control for LTI Systems

Reinforcement Learning for Inverse Linear-quadratic Dynamic Non-cooperative Games

Control of Vehicle Platoons with Collision Avoidance Using Noncooperative Differential Games

Data-Driven H-infinity Control with a Real-Time and Efficient Reinforcement Learning Algorithm: An Application to Autonomous Mobility-on-Demand Systems

Inverse linear-quadratic nonzero-sum differential games