Abstract:This paper addresses the zero‐sum game problem for strict‐feedback nonlinear multiagent systems with full‐state constraints. Specifically, this paper focuses on the zero‐sum game scenario, wherein multiple agents aim to optimize the control strategies while considering the conflicting objectives of their opponents. To handle the full‐state constraints, a one‐to‐one nonlinear mapping technique is employed to convert the original strict‐feedback system into a more manageable pure‐feedback system without state constraints. In order to find a Nash equilibrium for virtual control signals and external disturbances, a simplified reinforcement learning algorithm is proposed, which tackles the challenges posed by solving the Hamilton–Jacobi–Isaacs equation. Unlike the existing H∞ optimal control strategies that deal with matching conditions, the H∞ optimal control strategy for strict‐feedback nonlinear systems needs to address the computational complexity issue arising from the repeated derivation of the virtual controller. To overcome the high‐order virtual controller problem, an approach based on the dynamic surface technique is introduced. By incorporating an approximation term of the high‐order virtual controller into the value function, the computational complexity challenge is effectively resolved. Based on the Lyapunov stability theorem, it is proved that all signals of the closed‐loop systems are semi‐global uniformly ultimately bounded and the tracking control performance can be guaranteed. Finally, simulation results are given to verify the effectiveness of the proposed control strategy.

Adaptive Learning Based Output-Feedback Optimal Control of CT Two-Player Zero-Sum Games

Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Online Synchronous Approximate Optimal Learning Algorithm for Multi-Player Non-Zero-Sum Games with Unknown Dynamics.

Near Optimal Neural Network-based Output Feedback Control of Affine Nonlinear Discrete-Time Systems

Policy-Iteration-Based Learning for Nonlinear Player Game Systems with Constrained Inputs.

Approximate-optimal control algorithm for constrained zero-sum differential games through event-triggering mechanism

Adaptive Optimal Output-Feedback Consensus Tracking Control of Nonlinear Multiagent Systems Using Two-Player Stackelberg Game

Learning Algorithms For Differential Games Of Continuous-Time Systems

Adaptive Dynamic Programming for Two-Player Zero-Sum Differential Games with Completely Unknown Systems

Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics

Advanced optimal tracking integrating a neural critic technique for asymmetric constrained zero-sum games

Neural-network-based safe learning control for non-zero-sum differential games of nonlinear systems with asymmetric input constraints

Learning Human Behavior in Shared Control: Adaptive Inverse Differential Game Approach

Reinforcement Learning for Inverse Non-Cooperative Linear-Quadratic Output-feedback Differential Games

Adaptive Dynamic Programming for Solving Non-Zero-Sum Differential Games.

Asymmetric Feedback Learning in Online Convex Games

Zero‐sum game for nonlinear multiagent systems with full‐state constraints

Min–max adaptive dynamic programming for zero-sum differential games

Optimal Asymptotic Tracking Control for Nonzero-Sum Differential Game Systems with Unknown Drift Dynamics via Integral Reinforcement Learning

Output‐feedback Q‐learning for discrete‐time linear <i>H</i><sup>∞</sup> tracking control: A Stackelberg game approach