Abstract:This study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. A new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players and the simultaneous continuous‐time adaptive tuning laws for both critic and actor neural network weights are proposed. The algorithm is also enhanced to reduce the number of auxiliary NNs used in the critic. To reduce the learning time and space occupation, this study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. Based on the integral reinforcement learning method, a new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players is proposed. By leveraging the neural network approximation and gradient descent method, simultaneous continuous‐time adaptive tuning laws are provided for both critic and actor neural network weights. These laws facilitate the estimation of the optimal value function and optimal policy without requiring knowledge or identification of the system's dynamics. The closed‐loop system stability and convergence of weights are guaranteed through the Lyapunov analysis. Additionally, the algorithm is enhanced to reduce the number of auxiliary NNs used in the critic. The simulation results for a two‐player non‐zero‐sum game validate the effectiveness of the proposed algorithm.

Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics

Non‐zero‐sum games of discrete‐time Markov jump systems with unknown dynamics: An off‐policy reinforcement learning method

Discrete-Time Nonzero-Sum Games for Multiplayer Using Policy-Iteration-Based Adaptive Dynamic Programming Algorithms

Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game

Event-Triggered ADP for Nonzero-Sum Games of Unknown Nonlinear Systems

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Approximate N-Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System

Reinforcement Learning-Based Control for Nonlinear Discrete-Time Systems with Unknown Control Directions and Control Constraints

Novel single-loop policy iteration for linear zero-sum games

A novel Z-function-based completely model-free reinforcement learning method to finite-horizon zero-sum game of nonlinear system

Min–max adaptive dynamic programming for zero-sum differential games

Robust policy iteration for continuous-time stochastic $H_\infty$ control problem with unknown dynamics

Inverse linear-quadratic nonzero-sum differential games

A Fixed-Point Policy-Iteration-Type Algorithm for Symmetric Nonzero-Sum Stochastic Impulse Control Games

Discrete-Time LQ Stochastic Two-Person Nonzero-Sum Difference Games with Random Coefficients:~Open-Loop Nash Equilibrium

A Policy Iteration Algorithm for N-player General-Sum Linear Quadratic Dynamic Games

Nash Equilibria for Linear Quadratic Discrete-time Dynamic Games via Iterative and Data-driven Algorithms

Relaxed Policy Iteration Algorithm for Nonlinear Zero-Sum Games with Application to H-infinity Control

Multidimensional indefinite stochastic Riccati equations and zero-sum linear-quadratic stochastic differential games with non-markovian regime switching

Optimal Asymptotic Tracking Control for Nonzero-Sum Differential Game Systems with Unknown Drift Dynamics via Integral Reinforcement Learning

A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum Markov Games