Abstract:This paper addresses the zero‐sum game problem for strict‐feedback nonlinear multiagent systems with full‐state constraints. Specifically, this paper focuses on the zero‐sum game scenario, wherein multiple agents aim to optimize the control strategies while considering the conflicting objectives of their opponents. To handle the full‐state constraints, a one‐to‐one nonlinear mapping technique is employed to convert the original strict‐feedback system into a more manageable pure‐feedback system without state constraints. In order to find a Nash equilibrium for virtual control signals and external disturbances, a simplified reinforcement learning algorithm is proposed, which tackles the challenges posed by solving the Hamilton–Jacobi–Isaacs equation. Unlike the existing H∞ optimal control strategies that deal with matching conditions, the H∞ optimal control strategy for strict‐feedback nonlinear systems needs to address the computational complexity issue arising from the repeated derivation of the virtual controller. To overcome the high‐order virtual controller problem, an approach based on the dynamic surface technique is introduced. By incorporating an approximation term of the high‐order virtual controller into the value function, the computational complexity challenge is effectively resolved. Based on the Lyapunov stability theorem, it is proved that all signals of the closed‐loop systems are semi‐global uniformly ultimately bounded and the tracking control performance can be guaranteed. Finally, simulation results are given to verify the effectiveness of the proposed control strategy.

H∞ Control for Discrete-time Linear Systems by Integrating Off-policy Q-learning and Zero-sum Game

H∞ Tracking Control for Linear Discrete-Time Systems: Model-Free Q-Learning Designs

Output-feedback Q-learning for discrete-time linear H-infinity tracking control: A Stackelberg game approach

H∞ Control of Discrete Switched Systems with Time Delay Via State Feedback

Interactions of salts and denaturing agents with a polyacrylamide gel.

H∞output Feedback Fault-Tolerant Control of Industrial Processes Based on Zero-Sum Games and Off-Policy Q-learning

Off-Policy Reinforcement Learning for $ H_\infty $ Control Design

Output‐feedback Q‐learning for discrete‐time linear <i>H</i><sup>∞</sup> tracking control: A Stackelberg game approach

Learning-Based Nonlinear $H^\infty$ Control via Game-Theoretic Differential Dynamic Programming

Reinforcement Learning for Finite-Horizon H∞ Tracking Control of Unknown Discrete Linear Time-Varying System

H∞$$ {h}_{\infty } $$ Optimal Output Tracking Control for Markov Jump Systems: A Reinforcement Learning‐based Approach

Reinforcement Learning-Based Control for Nonlinear Discrete-Time Systems with Unknown Control Directions and Control Constraints

Minimax Optimal Control of Uncertain Quasi-Integrable Hamiltonian Systems with Time-Delayed Bounded Feedback

Event-driven H ∞ control with critic learning for nonlinear systems

A Finite-Horizon Inverse Linear Quadratic Optimal Control Method for Human-in-the-Loop Behavior Learning

On the Global Optimality of Direct Policy Search for Nonsmooth $H_\infty$ Output-Feedback Control

Robust policy iteration for continuous-time stochastic $H_\infty$ control problem with unknown dynamics

Event-Driven H ∞ -Constrained Control Using Adaptive Critic Learning

Zero‐sum game for nonlinear multiagent systems with full‐state constraints

Q‐learning‐based H∞ control for LPV systems

Model-free $H_{\infty}$ control of Itô stochastic system via off-policy reinforcement learning