Abstract:This study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. A new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players and the simultaneous continuous‐time adaptive tuning laws for both critic and actor neural network weights are proposed. The algorithm is also enhanced to reduce the number of auxiliary NNs used in the critic. To reduce the learning time and space occupation, this study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. Based on the integral reinforcement learning method, a new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players is proposed. By leveraging the neural network approximation and gradient descent method, simultaneous continuous‐time adaptive tuning laws are provided for both critic and actor neural network weights. These laws facilitate the estimation of the optimal value function and optimal policy without requiring knowledge or identification of the system's dynamics. The closed‐loop system stability and convergence of weights are guaranteed through the Lyapunov analysis. Additionally, the algorithm is enhanced to reduce the number of auxiliary NNs used in the critic. The simulation results for a two‐player non‐zero‐sum game validate the effectiveness of the proposed algorithm.

Integral Policy Iteration for Zero-Sum Games with Completely Unknown Nonlinear Dynamics

Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics

Online Synchronous Approximate Optimal Learning Algorithm for Multi-Player Non-Zero-Sum Games with Unknown Dynamics.

Policy Iteration Based Q-learning for Linear Nonzero-Sum Quadratic Differential Games.

Data-Driven Integral Reinforcement Learning for Continuous-Time Non-Zero-Sum Games

Adaptive Dynamic Programming for Solving Non-Zero-Sum Differential Games.

Relaxed Policy Iteration Algorithm for Nonlinear Zero-Sum Games with Application to H-infinity Control

Model-free Adaptive Dynamic Programming for Online Optimal Solution of the Unknown Nonlinear Zero-Sum Differential Game

Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games.

Policy-Iteration-Based Learning for Nonlinear Player Game Systems with Constrained Inputs.

Off-policy Integral Reinforcement Learning Algorithm in Dealing with Nonzero Sum Game for Nonlinear Distributed Parameter Systems.

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Adaptive Dynamic Programming for Two-Player Zero-Sum Differential Games with Completely Unknown Systems

Off-policy Synchronous Iteration IRL Method for Multi-Player Zero-Sum Games with Input Constraints

Online Finite-Horizon Optimal Learning Algorithm for Nonzero-Sum Games with Partially Unknown Dynamics and Constrained Inputs

Model‐free Adaptive Optimal Control of Continuous‐time Nonlinear Non‐zero‐sum Games Based on Reinforcement Learning

Neural-network-based Zero-Sum Game for Discrete-Time Nonlinear Systems Via Iterative Adaptive Dynamic Programming Algorithm

Learning Algorithms For Differential Games Of Continuous-Time Systems

Policy Gradient Adaptive Dynamic Programming for Nonlinear Discrete-Time Zero-Sum Games with Unknown Dynamics

Online Iterative Adaptive Dynamic Programming Approach for Solving the Zero-Sum Game for Nonlinear Continuous-Time Systems with Partially Unknown Dynamics

Mix-zero-sum Differential Games for Linear Systems with Unknown Dynamics Based on Off-Policy IRL.