Abstract:This paper addresses the zero‐sum game problem for strict‐feedback nonlinear multiagent systems with full‐state constraints. Specifically, this paper focuses on the zero‐sum game scenario, wherein multiple agents aim to optimize the control strategies while considering the conflicting objectives of their opponents. To handle the full‐state constraints, a one‐to‐one nonlinear mapping technique is employed to convert the original strict‐feedback system into a more manageable pure‐feedback system without state constraints. In order to find a Nash equilibrium for virtual control signals and external disturbances, a simplified reinforcement learning algorithm is proposed, which tackles the challenges posed by solving the Hamilton–Jacobi–Isaacs equation. Unlike the existing H∞ optimal control strategies that deal with matching conditions, the H∞ optimal control strategy for strict‐feedback nonlinear systems needs to address the computational complexity issue arising from the repeated derivation of the virtual controller. To overcome the high‐order virtual controller problem, an approach based on the dynamic surface technique is introduced. By incorporating an approximation term of the high‐order virtual controller into the value function, the computational complexity challenge is effectively resolved. Based on the Lyapunov stability theorem, it is proved that all signals of the closed‐loop systems are semi‐global uniformly ultimately bounded and the tracking control performance can be guaranteed. Finally, simulation results are given to verify the effectiveness of the proposed control strategy.

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game

A novel Z-function-based completely model-free reinforcement learning method to finite-horizon zero-sum game of nonlinear system

Model-free Adaptive Dynamic Programming for Optimal Control of Discrete-time Affine Nonlinear System

Non‐zero‐sum games of discrete‐time Markov jump systems with unknown dynamics: An off‐policy reinforcement learning method

Approximate N-Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System

Nonzero-Sum Games Using Actor-Critic Neural Networks: A Dynamic Event-Triggered Adaptive Dynamic Programming

Approximate-optimal control algorithm for constrained zero-sum differential games through event-triggering mechanism

Neural-network-based safe learning control for non-zero-sum differential games of nonlinear systems with asymmetric input constraints

Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks

Neural Q-learning for discrete-time nonlinear zero-sum games with adjustable convergence rate

Event-Triggered ADP for Nonzero-Sum Games of Unknown Nonlinear Systems

Adaptive Dynamic Programming for a Nonlinear Two‐Player Non‐Zero‐Sum Differential Game With State and Input Constraints

Online Reinforcement Learning-based Neural Network Controller Design for Affine Nonlinear Discrete-time Systems.

Near Optimal Neural Network-based Output Feedback Control of Affine Nonlinear Discrete-Time Systems

Online Adaptive Optimal Control Algorithm Based on Synchronous Integral Reinforcement Learning With Explorations

Min–max adaptive dynamic programming for zero-sum differential games

Zero‐sum game for nonlinear multiagent systems with full‐state constraints

Discrete-Time Nonzero-Sum Games for Multiplayer Using Policy-Iteration-Based Adaptive Dynamic Programming Algorithms

Optimal Evolution Strategy for Continuous Strategy Games on Complex Networks via Reinforcement Learning

Generalized Nash Equilibrium Seeking for Noncooperative Game With Different Monotonicities by Adaptive Neurodynamic Algorithm