Abstract:Considering overshoot and chatter caused by the unknown interference, this article studies the adaptive robust optimal controls of continuous‐time (CT) multi‐input systems with an approximate dynamic programming (ADP) based Q‐function scheme. An adaptive integral reinforcement learning (IRL) scheme is proposed to study the optimal solutions of Q‐functions. First, multi‐input value functions are presented, and Nash equilibrium is analyzed. A complex Hamilton–Jacobi–Issacs (HJI) equation is constructed with the multi‐input system and the zero‐sum‐game‐based value function. It is a challenging task to solve the HJI equation for nonlinear system. Thus, A transformation of the HJI equation is constructed as a Q‐function. The neural network (NN) is applied to learn the solution of the transformed Q‐functions based on the adaptive IRL scheme. Moreover, an error information is added to the Q‐function for the issue of insufficient initial incentives to relax the persistent excitation (PE) condition. Simultaneously, an IRL signal of the critic networks is introduced to study the saddle‐point intractable solution, such that the system drift and NN derivatives in the HJI equation are relaxed. The convergence of weight parameters is proved, and the closed‐loop stability of the multi‐system with the proposed IRL Q‐function scheme is analyzed. Finally, a two‐engine driven F‐16 aircraft plant and a nonlinear system are presented to verify the effectiveness of the proposed adaptive IRL Q‐function scheme.

Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems with Disturbances

Synchronous Optimal Control Method for Nonlinear Systems with Saturating Actuators and Unknown Dynamics Using Off-Policy Integral Reinforcement Learning

Robust Optimal Control for a Class of Nonlinear Systems with Unknown Disturbances Based on Disturbance Observer and Policy Iteration.

Off-policy Neuro-Optimal Control for Unknown Complex-Valued Nonlinear Systems Based on Policy Iteration

Reinforcement Learning-Based Anti-disturbances Adaptive Control for Systems Subjected to Mismatched Disturbances and Input Uncertainties

Optimal Robust Control of Nonlinear Uncertain System Via Off-Policy Integral Reinforcement Learning

Reinforcement Learning-Based Control for Nonlinear Discrete-Time Systems with Unknown Control Directions and Control Constraints

Off-policy integral reinforcement learning optimal tracking control for continuous-time chaotic systems

Disturbance Observer Based Actor-Critic Learning Control for Uncertain Nonlinear Systems

Robust optimal control of the multi‐input systems with unknown disturbance based on adaptive integral reinforcement learning Q‐function

Online adaptive data-driven control for unknown nonlinear systems with constrained-input

Off-Policy Risk-Sensitive Reinforcement Learning-Based Constrained Robust Optimal Control

Online Off-Policy Reinforcement Learning for Optimal Control of Unknown Nonlinear Systems Using Neural Networks

A learning optimal control scheme for robust stabilization of a class of uncertain nonlinear systems

Off‐policy reinforcement learning algorithm for robust optimal control of uncertain nonlinear systems

Robust Near-optimal Control for Constrained Nonlinear System via Integral Reinforcement Learning

Event-triggered-based Online Integral Reinforcement Learning for Optimal Control of Unknown Constrained Nonlinear Systems.

Neural-Network-Based Online Optimal Control for Uncertain Non-Linear Continuous-Time Systems with Control Constraints

Indirect Adaptive Fuzzy-Regulated Optimal Control for Unknown Continuous-Time Nonlinear Systems.

Generalized Policy Iteration-based Reinforcement Learning Algorithm for Optimal Control of Unknown Discrete-time Systems

Data-Based Self-Learning Optimal Control For Continuous-Time Unknown Nonlinear Systems With Disturbance