Abstract:This study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. A new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players and the simultaneous continuous‐time adaptive tuning laws for both critic and actor neural network weights are proposed. The algorithm is also enhanced to reduce the number of auxiliary NNs used in the critic. To reduce the learning time and space occupation, this study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. Based on the integral reinforcement learning method, a new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players is proposed. By leveraging the neural network approximation and gradient descent method, simultaneous continuous‐time adaptive tuning laws are provided for both critic and actor neural network weights. These laws facilitate the estimation of the optimal value function and optimal policy without requiring knowledge or identification of the system's dynamics. The closed‐loop system stability and convergence of weights are guaranteed through the Lyapunov analysis. Additionally, the algorithm is enhanced to reduce the number of auxiliary NNs used in the critic. The simulation results for a two‐player non‐zero‐sum game validate the effectiveness of the proposed algorithm.

Optimal Tracking Control for Non-Zero-sum Games of Linear Discrete-Time Systems Via Off-Policy Reinforcement Learning

Optimal Tracking Control for Multi-player Non-Zero-Sum Games of Continuous-Time Linear Systems with Unknown Dynamics.

A Learning-Based Optimal Tracking Controller for Continuous Linear Systems with Unknown Dynamics: Theory and Case Study

Robust Optimal Tracking Control for Multiplayer Systems by Off‐policy Q‐learning Approach

Primal-Dual Reinforcement Learning for Zero-Sum Games in the Optimal Tracking Control

Data-Efficient Off-Policy Learning for Distributed Optimal Tracking Control of HMAS with Unidentified Exosystem Dynamics.

Off-policy Based Adaptive Dynamic Programming Method for Nonzero-Sum Games on Discrete-Time System

Model‐free Adaptive Optimal Control of Continuous‐time Nonlinear Non‐zero‐sum Games Based on Reinforcement Learning

Robust Optimal Tracking Control for Linear Systems via Adaptive Dynamic Programming method

Robust Tracking Control and Output Regulation

Optimal Asymptotic Tracking Control for Nonzero-Sum Differential Game Systems with Unknown Drift Dynamics via Integral Reinforcement Learning

Off-policy reinforcement learning for tracking control of discrete-time Markov jump linear systems with completely unknown dynamics.

Non‐zero‐sum games of discrete‐time Markov jump systems with unknown dynamics: An off‐policy reinforcement learning method

Data-driven Optimal Tracking Control for a Class of Affine Non-Linear Continuous-Time Systems with Completely Unknown Dynamics

Nearly Optimal Control for Mixed Zero-Sum Game Based on Off-Policy Integral Reinforcement Learning

Adaptive Optimal Tracking Controls of Unknown Multi-Input Systems Based on Nonzero-Sum Game Theory

Nash Tracking Controls Of Multi-Input Nonzero-Sum Game System With Reinforcement Learning

Safe tracking in games: Achieving optimal control with unknown dynamics and constraints

Data-driven Approximate Optimal Tracking Control Schemes for Unknown Non-Affine Non-Linear Multi-Player Systems Via Adaptive Dynamic Programming

H∞ Tracking Control for Linear Discrete‐time Systems Via Reinforcement Learning

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration