Abstract:In this paper, we discuss a novel algorithm for learning the solution to the optimal control problem (OCP) for affine nonlinear continuous-time constrained-input systems with completely unknown dynamics on a data-driven integral reinforcement learning (IRL) basis. It is well known that we have to obtain the solution of the nonlinear OCP by means of resolving the Hamilton-Jacobi-Bellman equation (HJBE). However, the HJBE is usually a nonlinear partial differential equation that cannot be solved analytically. To make matters worse, most practical systems are too complex to be accurately mathematically modelled and have real-time errors in the system’s controller. To address the above issues, we propose an online data-driven IRL algorithm that is anchored in policy iteration (PI), using real-time data from practical systems, rather than system models or partially sampled data from systems. To begin with, the PI algorithm is shown. Then, we approximate the performance function and the control policy using a critic neural network (CNN) and an actor neural network (ANN), respectively. The approach presented is an online-policy IRL, where the data are continuously sampled in the input and state domains. The weights of the CNN and ANN are renewed by least squares using the collected data, which minimizes residual errors. Finally, the validity of the approach in solving the OCP is demonstrated from the simulation results.

Synergetic Learning for Unknown Nonlinear H∞ Control Using Neural Networks.

Synergetic Learning Neuro-Control for Unknown Affine Nonlinear Systems With Asymptotic Stability Guarantees

H∞ Control with Constrained Input for Completely Unknown Nonlinear Systems Using Data-Driven Reinforcement Learning Method

Online reinforcement learning control of unknown nonaffine nonlinear discrete time systems

Learning from Adaptive Neural Control of SISO Strict-Feedback Nonlinear Systems

Online Reinforcement Learning-based Neural Network Controller Design for Affine Nonlinear Discrete-time Systems.

Reinforcement Learning-Based Control for Nonlinear Discrete-Time Systems with Unknown Control Directions and Control Constraints

Adaptive Neural Network Control of Nonlinear Systems with Unknown Dynamics

Reinforcement-Learning-Based Controller Design for Nonaffine Nonlinear Systems.

Adaptive H∞ Control for a Class of Non-Linear Systems Using Neural Networks

Fuzzy H∞ Control of Discrete-Time Nonlinear Markov Jump Systems via a Novel Hybrid Reinforcement Q-Learning Method

Off-policy reinforcement learning for H∞ control design.

Online adaptive data-driven control for unknown nonlinear systems with constrained-input

Online Adaptive Policy Learning Algorithm for H-Infinity State Feedback Control of Unknown Affine Nonlinear Discrete-Time Systems

Dynamic Learning from Adaptive Neural Network Control of a Class of Nonaffine Nonlinear Systems

Off-Policy Reinforcement Learning for $ H_\infty $ Control Design

Neural-network-based safe learning control for non-zero-sum differential games of nonlinear systems with asymmetric input constraints

Fuzzy $H_{\infty }$ Control of Discrete-Time Nonlinear Markov Jump Systems via a Novel Hybrid Reinforcement $Q$-Learning Method

Neural network-based finite-horizon optimal control of uncertain affine nonlinear discrete-time systems

Neuroadaptive learning algorithm for constrained nonlinear systems with disturbance rejection

H-Infinity Tracking of Unknown Nonlinear Systems Using Neural Network