Abstract:In this study, a novel online adaptive dynamic programming (ADP)-based algorithm is developed for solving the optimal control problem of affine non-linear continuous-time systems with unknown internal dynamics. The present algorithm employs an observer-critic architecture to approximate the Hamilton-Jacobi-Bellman equation. Two neural networks (NNs) are used in this architecture: an NN state observer is constructed to estimate the unknown system dynamics and a critic NN is designed to derive the optimal control instead of typical action-critic dual networks employed in traditional ADP algorithms. Based on the developed architecture, the observer NN and the critic NN are tuned simultaneously. Meanwhile, unlike existing tuning laws for the critic, the newly developed critic update rule not only ensures convergence of the critic to the optimal control but also guarantees stability of the closed-loop system. No initial stabilising control is required, and by using recorded and instantaneous data simultaneously for the adaptation of the critic, the restrictive persistence of excitation condition is relaxed. In addition, Lyapunov direct method is utilised to demonstrate the uniform ultimate boundedness of the weights of the observer NN and the critic NN. Finally, an example is provided to verify the effectiveness of the present approach.

A Novel On-Line Vi-Adp For Nonlinear Discrete-Time Systems

Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems

Model-free Adaptive Dynamic Programming for Optimal Control of Discrete-time Affine Nonlinear System

A novel adaptive dynamic programming based on tracking error for nonlinear discrete-time systems

A Novel Stable Value Iteration-Based Approximate Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems

Neural-network-based Optimal Control for Discrete-Time Nonlinear Systems Using General Value Iteration

Optimal control for discrete-time affine non-linear systems using general value iteration

Local Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems

Online Approximate Optimal Control for Affine Non-Linear Systems with Unknown Internal Dynamics Using Adaptive Dynamic Programming

Nearly Finite-Horizon Optimal Control for A Class of Nonaffine Time-Delay Nonlinear Systems Based on Adaptive Dynamic Programming

Finite-approximation-error-based Discrete-Time Iterative Adaptive Dynamic Programming.

Infinite-time stochastic linear quadratic optimal control for unknown discrete-time systems using adaptive dynamic programming approach

Nearly Finite-Horizon Optimal Control for Nonaffine Time-Delay Nonlinear Systems

Adaptive Optimal Control for a Class of Nonlinear Systems: the Online Policy Iteration Approach.

A Parallel Framework of Adaptive Dynamic Programming Algorithm with Off-Policy Learning.

Data-Based On-Line Optimal Control for Unknown Nonlinear Systems Via Adaptive Dynamic Programming Approach

Infinite Horizon Self-Learning Optimal Control of Nonaffine Discrete-Time Nonlinear Systems

Adaptive Dynamic Programming for a Class of Complex-Valued Nonlinear Systems

Twin Deterministic Policy Gradient Adaptive Dynamic Programming for Optimal Control of Affine Nonlinear Discrete-time Systems

Online Adaptive Optimal Control for Continuous-Time Nonlinear Systems with Completely Unknown Dynamics.

Near-optimal Control for Continuous-Time Nonlinear Systems with Control Constraints Using On-Line ADP