Abstract:This study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. A new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players and the simultaneous continuous‐time adaptive tuning laws for both critic and actor neural network weights are proposed. The algorithm is also enhanced to reduce the number of auxiliary NNs used in the critic. To reduce the learning time and space occupation, this study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. Based on the integral reinforcement learning method, a new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players is proposed. By leveraging the neural network approximation and gradient descent method, simultaneous continuous‐time adaptive tuning laws are provided for both critic and actor neural network weights. These laws facilitate the estimation of the optimal value function and optimal policy without requiring knowledge or identification of the system's dynamics. The closed‐loop system stability and convergence of weights are guaranteed through the Lyapunov analysis. Additionally, the algorithm is enhanced to reduce the number of auxiliary NNs used in the critic. The simulation results for a two‐player non‐zero‐sum game validate the effectiveness of the proposed algorithm.

A Novel Actor–critic–identifier Architecture for Nonlinear Multiagent Systems with Gradient Descent Method

Multi-Agent Reinforcement Learning Control for Consensus Problems of Uncertain Nonlinear Multi-Agent Systems

Online Reinforcement Learning-based Neural Network Controller Design for Affine Nonlinear Discrete-time Systems.

Model-free Adaptive Dynamic Programming for Optimal Control of Discrete-time Affine Nonlinear System

Near Optimal Neural Network-based Output Feedback Control of Affine Nonlinear Discrete-Time Systems

Online Reinforcement Learning Neural Network Controller Design for Nanomanipulation

Optimal Leader-Following Consensus Control of Multi-Agent Systems: A Neural Network Based Graphical Game Approach

Optimized leader-follower consensus control for high-order nonlinear multi-agent system modeled in canonical dynamic form

Online optimal consensus control of unknown linear multi-agent systems via time-based adaptive dynamic programming

Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks

Optimized backstepping consensus control using adaptive observer-critic-actor reinforcement learning for strict-feedback multi-agent systems

Reinforcement learning-based consensus control for MASs with intermittent constraints

Observer-Based Adaptive Neural Inverse Optimal Consensus Control of Nonlinear Multiagent Systems

Neural‐network‐based adaptive leader‐following consensus control for second‐order non‐linear multi‐agent systems

Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning

Adaptive NN Control for Nonlinear Multi-Agent Systems With Unknown Control Direction and Full State Constraints

Intermittent Event-Triggered Optimal Leader-Following Consensus for Nonlinear Multi-Agent Systems via Actor-Critic Algorithm

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Dynamic Event-Driven Finite-Horizon Optimal Consensus Control for Constrained Multiagent Systems

Adaptive Reinforcement Learning for Fault-Tolerant Optimal Consensus Control of Nonlinear Canonical Multiagent Systems With Actuator Loss of Effectiveness

Optimal consensus control for unknown second-order multi-agent systems: Using model-free reinforcement learning method