Abstract:This study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. A new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players and the simultaneous continuous‐time adaptive tuning laws for both critic and actor neural network weights are proposed. The algorithm is also enhanced to reduce the number of auxiliary NNs used in the critic. To reduce the learning time and space occupation, this study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. Based on the integral reinforcement learning method, a new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players is proposed. By leveraging the neural network approximation and gradient descent method, simultaneous continuous‐time adaptive tuning laws are provided for both critic and actor neural network weights. These laws facilitate the estimation of the optimal value function and optimal policy without requiring knowledge or identification of the system's dynamics. The closed‐loop system stability and convergence of weights are guaranteed through the Lyapunov analysis. Additionally, the algorithm is enhanced to reduce the number of auxiliary NNs used in the critic. The simulation results for a two‐player non‐zero‐sum game validate the effectiveness of the proposed algorithm.

Approximate Nash Solutions for Multiplayer Mixed-Zero-Sum Game with Reinforcement Learning

Data-Driven Nonzero-Sum Game for Discrete-Time Systems Using Off-Policy Reinforcement Learning

Online Optimal Solutions for Multi-Player Nonzero-Sum Game with Completely Unknown Dynamics

Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics

Reinforcement Learning Based Solution To Two-Player Zero-Sum Game Using Differentiator

Monte Carlo Neural Fictitious Self-Play: Achieve Approximate Nash equilibrium of Imperfect-Information Games.

Off-policy Synchronous Iteration IRL Method for Multi-Player Zero-Sum Games with Input Constraints

Online Synchronous Approximate Optimal Learning Algorithm for Multi-Player Non-Zero-Sum Games with Unknown Dynamics.

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Multiplayer Stackelberg-Nash Game for Nonlinear System via Value Iteration-Based Integral Reinforcement Learning

Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games.

Neural Auto-Curricula in Two-Player Zero-Sum Games.

Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game

Optimal Tracking Control for Multi-player Non-Zero-Sum Games of Continuous-Time Linear Systems with Unknown Dynamics.

Robust Optimal Control for Disturbed Nonlinear Zero-Sum Differential Games Based on Single NN and Least Squares

Nash Equilibrium in Iterated Multiplayer Games Under Asynchronous Best-Response Dynamics

Model-Based Reinforcement Learning for Offline Zero-Sum Markov Games

Mix-zero-sum Differential Games for Linear Systems with Unknown Dynamics Based on Off-Policy IRL.

Online Dual-Network-Based Adaptive Dynamic Programming for Solving Partially Unknown Multi-Player Non-Zero-Sum Games with Control Constraints

Nash Equilibrium Seeking in Non-Zero-Sum Games: A Prescribed-Time Fuzzy Control Approach

Neural-network-based Synchronous Iteration Learning Method for Multi-Player Zero-Sum Games.