Abstract:This paper establishes an approximate optimal critic learning algorithm based on single neural network (NN) policy iteration (PI) aiming at solving for continuous-time (CT) 2-player zero-sum games (ZSGs). In fact, we have to face the problem that the errors will disturb the dynamics and in turn identifying dynamics will generate errors. In order to prevent the effect of errors, in this paper, a single NN-based online PI algorithm is developed for the CT system, which is disturbed nonlinear ZSG. With plenty of online data, the Hamilton-Jacobi-Isaacs equation can be solved without complete dynamics. Then by the least-squares method, we can obtain the NN weights. Moreover, in the process of dealing with the undisturbed system, we find the way that obtains NN weights in this paper is equal to the way that obtains the optimal solution by the Gauss-Newton method. Based on the convergence of the Gauss-Newton method, we can efficiently obtain the optimal controller for the undisturbed system by utilizing online data. After getting the controller of the undisturbed system, it is time to take disturbance into consideration, so that we design a robust control pair to overcome the disturbance. In order to demonstrate the effectiveness of this algorithm, we design a set of simulations. The results verify that we can solve the disturbed nonlinear ZSG by this algorithm.

Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems with Disturbance Using ADP

Data-Based Self-Learning Optimal Control For Continuous-Time Unknown Nonlinear Systems With Disturbance

Data-driven Optimal Control for a Class of Unknown Continuous-Time Nonlinear System Using a Novel ADP Method

Neural Network Optimal Control for Nonlinear System Based on Zero-Sum Differential Game.

Data-Based On-Line Optimal Control for Unknown Nonlinear Systems Via Adaptive Dynamic Programming Approach

Data-based Approximate Optimal Control for Nonzero-Sum Games of Multi-Player Systems Using Adaptive Dynamic Programming.

Adaptive Dynamic Programming-Based Optimal Control of Unknown Nonaffine Nonlinear Discrete-Time Systems with Proof of Convergence

Data-Driven Optimal Control for Multi-Player Non-Zero-Sum Games with Unknown Dynamics

Multi-objective optimal control for a class of unknown nonlinear systems based on finite-approximation-error ADP algorithm

Zero-Sum Game-Based Decentralized Optimal Control for Saturated Nonlinear Interconnected Systems via a Data and Event Driven Approach

Near-Optimal Control for Nonzero-Sum Differential Games of Continuous-Time Nonlinear Systems Using Single-Network Adp

Event-triggered Adaptive Dynamic Programming for Multi-Player Zero-Sum Games with Unknown Dynamics

Multiobjective Optimal Control for a Class of Unknown Nonlinear Systems Based on Finite-Approximation-Error ADP Algorithm

Data-Driven Robust Approximate Optimal Tracking Control for Unknown General Nonlinear Systems Using Adaptive Dynamic Programming Method

An Iterative Adaptive Dynamic Programming Algorithm for Optimal Control of Unknown Discrete-Time Nonlinear Systems with Constrained Inputs.

Event-Driven Optimal Control For Uncertain Nonlinear Systems With External Disturbance Via Adaptive Dynamic Programming

Data-driven finite-horizon optimal tracking control scheme for completely unknown discrete-time nonlinear systems.

Event-Triggered Robust Optimal Control for Nonlinear Systems With Uncertain Dynamics and Nonzero-equilibrium

Neural-network-based Approach to Finite-Time Optimal Control for a Class of Unknown Nonlinear Systems

Adaptive Dynamic Programming for Robust Neural Control of Unknown Continuous-Time Non-Linear Systems

Robust Optimal Control for Disturbed Nonlinear Zero-Sum Differential Games Based on Single NN and Least Squares