Abstract:In this paper, we propose a novel differential-game based neural network (NN) control architecture to solve an optimal control problem for a class of large-scale nonlinear systems involving N-players. We focus on optimizing the usage of the computational resources along with the system performance simultaneously. In particular, the N-players' control policies are desired to be designed such that they cooperatively optimize the large-scale system performance, and the sampling intervals for each player are desired to reduce the frequency of feedback execution. To develop a unified design framework that achieves both these objectives, we propose an optimal control problem by integrating both the design requirements, which leads to a multi-player differential-game. A solution to this problem is numerically obtained by solving the associated Hamilton-Jacobi (HJ) equation using event-driven approximate dynamic programming (E-ADP) and artificial NNs online and forward-in-time. We employ the critic neural networks to approximate the solution to the HJ equation, i.e., the optimal value function, with aperiodically available feedback information. Using the NN approximated value function, we design the control policies and the sampling schemes. Finally, the event-driven N-player system is remodeled as a hybrid dynamical system with impulsive weight update rules for analyzing its stability and convergence properties. The closed-loop practical stability of the system and Zeno free behavior of the sampling scheme are demonstrated using the Lyapunov method. Simulation results using a numerical example are also included to substantiate the analytical results.

Min–max adaptive dynamic programming for zero-sum differential games

Adaptive Dynamic Programming for a Nonlinear Two‐Player Non‐Zero‐Sum Differential Game With State and Input Constraints

Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Non‐zero‐sum games of discrete‐time Markov jump systems with unknown dynamics: An off‐policy reinforcement learning method

Neural-network-based safe learning control for non-zero-sum differential games of nonlinear systems with asymmetric input constraints

Approximate-optimal control algorithm for constrained zero-sum differential games through event-triggering mechanism

Inverse linear-quadratic nonzero-sum differential games

Discrete-Time Nonzero-Sum Games for Multiplayer Using Policy-Iteration-Based Adaptive Dynamic Programming Algorithms

Nonzero-Sum Games Using Actor-Critic Neural Networks: A Dynamic Event-Triggered Adaptive Dynamic Programming

Newton’s Method, Bellman Recursion and Differential Dynamic Programming for Unconstrained Nonlinear Dynamic Games

Approximate N-Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System

A kind of linear quadratic non-zero sum differential game of backward stochastic differential equation with asymmetric information

Event-Triggered Single-Network ADP for Zero-Sum Game of Unknown Nonlinear Systems with Constrained Input

Differential-game for resource aware approximate optimal control of large-scale nonlinear systems with multiple players

Optimal Asymptotic Tracking Control for Nonzero-Sum Differential Game Systems with Unknown Drift Dynamics via Integral Reinforcement Learning

A zero-sum hybrid stochastic differential game with switching controls

Linear-Quadratic Non-Zero Sum Backward Stochastic Differential Game With Overlapping Information

Event-Triggered ADP for Nonzero-Sum Games of Unknown Nonlinear Systems

A zero-sum hybrid stochastic differential game with impulse controls

Two person non-zero-sum linear-quadratic differential game with Markovian jumps in infinite horizon