Abstract:This study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. A new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players and the simultaneous continuous‐time adaptive tuning laws for both critic and actor neural network weights are proposed. The algorithm is also enhanced to reduce the number of auxiliary NNs used in the critic. To reduce the learning time and space occupation, this study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. Based on the integral reinforcement learning method, a new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players is proposed. By leveraging the neural network approximation and gradient descent method, simultaneous continuous‐time adaptive tuning laws are provided for both critic and actor neural network weights. These laws facilitate the estimation of the optimal value function and optimal policy without requiring knowledge or identification of the system's dynamics. The closed‐loop system stability and convergence of weights are guaranteed through the Lyapunov analysis. Additionally, the algorithm is enhanced to reduce the number of auxiliary NNs used in the critic. The simulation results for a two‐player non‐zero‐sum game validate the effectiveness of the proposed algorithm.

Data-Driven Integral Reinforcement Learning for Continuous-Time Non-Zero-Sum Games

Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics

Integral Policy Iteration for Zero-Sum Games with Completely Unknown Nonlinear Dynamics

Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games.

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Data-Driven Nonzero-Sum Game for Discrete-Time Systems Using Off-Policy Reinforcement Learning

Off-policy Integral Reinforcement Learning Algorithm in Dealing with Nonzero Sum Game for Nonlinear Distributed Parameter Systems.

Model‐free Adaptive Optimal Control of Continuous‐time Nonlinear Non‐zero‐sum Games Based on Reinforcement Learning

Model-Free Temporal Difference Learning For Non-Zero-Sum Games

Integral Reinforcement Learning Off-Policy Method for Solving Nonlinear Multi-Player Nonzero-Sum Games with Saturated Actuator.

Learning Algorithms For Differential Games Of Continuous-Time Systems

Data-driven Adaptive Dynamic Programming Schemes for Non-Zero-sum Games of Unknown Discrete-Time Nonlinear Systems

Integral Reinforcement Learning-Based Optimal Control for Nonzero-Sum Games of Multi-Player Input-Constrained Nonlinear Systems

Integral Reinforcement Learning-Based Online Adaptive Event-Triggered Control for Non-Zero-sum Games of Partially Unknown Nonlinear Systems.

Event-triggered Adaptive Integral Reinforcement Learning Method for Zero-Sum Differential Games of Nonlinear Systems with Incomplete Known Dynamics

Online reinforcement learning multiplayer non-zero sum games of continuous-time Markov jump linear systems

IRL Method for Time-Continuous Two-Player Nonzero Sum Game of Unknown System with Constrained-Input

Policy Iteration Based Q-learning for Linear Nonzero-Sum Quadratic Differential Games.

Model-free Adaptive Dynamic Programming for Online Optimal Solution of the Unknown Nonlinear Zero-Sum Differential Game

Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics

Value Iteration Based Integral Reinforcement Learning Approach for H∞ Controller Design of Continuous-Time Nonlinear Systems