Abstract:This study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. A new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players and the simultaneous continuous‐time adaptive tuning laws for both critic and actor neural network weights are proposed. The algorithm is also enhanced to reduce the number of auxiliary NNs used in the critic. To reduce the learning time and space occupation, this study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. Based on the integral reinforcement learning method, a new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players is proposed. By leveraging the neural network approximation and gradient descent method, simultaneous continuous‐time adaptive tuning laws are provided for both critic and actor neural network weights. These laws facilitate the estimation of the optimal value function and optimal policy without requiring knowledge or identification of the system's dynamics. The closed‐loop system stability and convergence of weights are guaranteed through the Lyapunov analysis. Additionally, the algorithm is enhanced to reduce the number of auxiliary NNs used in the critic. The simulation results for a two‐player non‐zero‐sum game validate the effectiveness of the proposed algorithm.

Model-free optimal tracking policies for Markov jump systems by solving non-zero-sum games

Fuzzy-model-based Tracking Control of Markov Jump Nonlinear Systems with Incomplete Mode Information

H∞$$ {h}_{\infty } $$ Optimal Output Tracking Control for Markov Jump Systems: A Reinforcement Learning‐based Approach

Non‐zero‐sum games of discrete‐time Markov jump systems with unknown dynamics: An off‐policy reinforcement learning method

Finite-time L2−l∞ Tracking Control for Markov Jump Repeated Scalar Nonlinear Systems with Partly Usable Model Information

Safe tracking in games: Achieving optimal control with unknown dynamics and constraints

Model-Free Optimal Tracking Design With Evolving Control Strategies via Q-Learning

H∞ optimal output tracking control for Markov jump systems: A reinforcement learning‐based approach

Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game

Asynchronous Event-triggered Control for Polynomial Fuzzy-Model-Based Markov Jump Systems with Complex Transition Probabilities

Adaptive Distributed Tracking Control for Markov Jump Multiagent Systems with a Non-Strict Leader

Dynamic event-triggered robust optimal tracking control for multi-player nonzero-sum games with mismatched uncertainties and asymmetric constrained inputs

Asynchronous Control for Discrete-Time Markovian Jump Systems with Multiplicative Noise

Asynchronous Event-Triggered Output-Feedback Control of Singular Markov Jump Systems

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Model Reduction of Markovian Jump Systems with Uncertain Probabilities

Advanced optimal tracking integrating a neural critic technique for asymmetric constrained zero-sum games

Multi‐event‐triggered adaptive dynamic programming for non‐zero‐sum game of unknown nonlinear system

Optimization of Markov Jump Linear System with Controlled Jump Probabilities of Modes

Robust Tracking and Model Following for Uncertain Markov Jump Linear Systems

Primal-Dual Reinforcement Learning for Zero-Sum Games in the Optimal Tracking Control