Abstract:This study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. A new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players and the simultaneous continuous‐time adaptive tuning laws for both critic and actor neural network weights are proposed. The algorithm is also enhanced to reduce the number of auxiliary NNs used in the critic. To reduce the learning time and space occupation, this study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. Based on the integral reinforcement learning method, a new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players is proposed. By leveraging the neural network approximation and gradient descent method, simultaneous continuous‐time adaptive tuning laws are provided for both critic and actor neural network weights. These laws facilitate the estimation of the optimal value function and optimal policy without requiring knowledge or identification of the system's dynamics. The closed‐loop system stability and convergence of weights are guaranteed through the Lyapunov analysis. Additionally, the algorithm is enhanced to reduce the number of auxiliary NNs used in the critic. The simulation results for a two‐player non‐zero‐sum game validate the effectiveness of the proposed algorithm.

Online Adaptive Optimal Control Algorithm Based on Synchronous Integral Reinforcement Learning With Explorations

Online Synchronous Iterative Algorithm for Optimal Control of Stochastic Affine Nonlinear Systems

Online adaptive learning of optimal control solutions using integral reinforcement learning

Synchronous Optimal Control Method for Nonlinear Systems with Saturating Actuators and Unknown Dynamics Using Off-Policy Integral Reinforcement Learning

Event-triggered-based Online Integral Reinforcement Learning for Optimal Control of Unknown Constrained Nonlinear Systems.

Online Reinforcement Learning-based Neural Network Controller Design for Affine Nonlinear Discrete-time Systems.

Model-free Adaptive Dynamic Programming for Optimal Control of Discrete-time Affine Nonlinear System

Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems Using Online Approximators

Adaptive Optimal Control for a Class of Continuous-Time Affine Nonlinear Systems with Unknown Internal Dynamics

Online adaptive data-driven control for unknown nonlinear systems with constrained-input

Online Adaptive Optimal Control for Continuous-Time Nonlinear Systems with Completely Unknown Dynamics.

Near Optimal Neural Network-based Output Feedback Control of Affine Nonlinear Discrete-Time Systems

Online Off-Policy Reinforcement Learning for Optimal Control of Unknown Nonlinear Systems Using Neural Networks

Generalized Policy Iteration-based Reinforcement Learning Algorithm for Optimal Control of Unknown Discrete-time Systems

Adaptive Optimal Control for Unknown Constrained Nonlinear Systems with a Novel Quasi-Model Network

Model‐free Adaptive Optimal Control of Continuous‐time Nonlinear Non‐zero‐sum Games Based on Reinforcement Learning

Reinforcement Learning for Adaptive Optimal Control of Unknown Continuous-Time Nonlinear Systems with Input Constraints.

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Online Adaptive Optimal Control Algorithm of Partial Unknown System with Adding Experience Replay and Safety Check

Online Adaptive Optimization Algorithm for Semi-Markov Control Processes

Neural-Network-Based Online Optimal Control for Uncertain Non-Linear Continuous-Time Systems with Control Constraints