Abstract:This paper studies the cooperative tracking control problem of interacted multi‐agent systems under undirected communication. An infinite horizon cooperative differential graphical game‐theoretic tracking control framework along with a data‐driven off‐policy integral reinforcement learning scheme is proposed, where several theoretical results are rigorously established. Simulated results are conducted to validate the effectiveness of the proposed game and IRL‐based tracking control method. This paper studies the cooperative tracking control problem of interacted multi‐agent systems (MASs) under undirected communication. Based on differential graphical game theory, the MAS tracking control problem is formulated as an infinite horizon cooperative differential graphical game‐theoretic tracking control framework, where a multi‐objective optimization problem is designed and then cast into a Pareto‐equivalent single‐objective optimization problem using a scalarization method. Necessary and sufficient conditions for the existence of the Pareto‐optimal strategy to the game theoretic tracking control are established, where it has been proven that the solution to the integral Bellman optimality equation leads to Pareto‐optimal strategy. Then, an off‐policy integral reinforcement learning scheme to find optimal control strategy using a pure data‐driven manner is developed, which consumes less computation efforts than the traditional learning scheme. Simulated results are conducted to validate the effectiveness of the proposed game and IRL‐based tracking control method.

Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games.

Non‐zero‐sum games of discrete‐time Markov jump systems with unknown dynamics: An off‐policy reinforcement learning method

Discrete-Time Nonzero-Sum Games for Multiplayer Using Policy-Iteration-Based Adaptive Dynamic Programming Algorithms

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game

A novel Z-function-based completely model-free reinforcement learning method to finite-horizon zero-sum game of nonlinear system

Multiplayer Stackelberg-Nash Game for Nonlinear System via Value Iteration-Based Integral Reinforcement Learning

Approximate N-Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System

Optimal Asymptotic Tracking Control for Nonzero-Sum Differential Game Systems with Unknown Drift Dynamics via Integral Reinforcement Learning

A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum Markov Games

Robust policy iteration for continuous-time stochastic $H_\infty$ control problem with unknown dynamics

Reinforcement Learning In Two Player Zero Sum Simultaneous Action Games

Safe tracking in games: Achieving optimal control with unknown dynamics and constraints

Learning to Play General-Sum Games against Multiple Boundedly Rational Agents

Novel single-loop policy iteration for linear zero-sum games

Event-Triggered ADP for Nonzero-Sum Games of Unknown Nonlinear Systems

Differential graphical game‐based multi‐agent tracking control using integral reinforcement learning

Relaxed Policy Iteration Algorithm for Nonlinear Zero-Sum Games with Application to H-infinity Control

A Policy Iteration Algorithm for N-player General-Sum Linear Quadratic Dynamic Games

FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game

Score-Based Equilibrium Learning in Multi-Player Finite Games with Imperfect Information