Abstract:In this study, a novel online adaptive dynamic programming (ADP)-based algorithm is developed for solving the optimal control problem of affine non-linear continuous-time systems with unknown internal dynamics. The present algorithm employs an observer-critic architecture to approximate the Hamilton-Jacobi-Bellman equation. Two neural networks (NNs) are used in this architecture: an NN state observer is constructed to estimate the unknown system dynamics and a critic NN is designed to derive the optimal control instead of typical action-critic dual networks employed in traditional ADP algorithms. Based on the developed architecture, the observer NN and the critic NN are tuned simultaneously. Meanwhile, unlike existing tuning laws for the critic, the newly developed critic update rule not only ensures convergence of the critic to the optimal control but also guarantees stability of the closed-loop system. No initial stabilising control is required, and by using recorded and instantaneous data simultaneously for the adaptation of the critic, the restrictive persistence of excitation condition is relaxed. In addition, Lyapunov direct method is utilised to demonstrate the uniform ultimate boundedness of the weights of the observer NN and the critic NN. Finally, an example is provided to verify the effectiveness of the present approach.

Error Bounds of Adaptive Dynamic Programming Algorithms for Solving Undiscounted Optimal Control Problems

Error Bound Analysis of Q-Function for Discounted Optimal Control Problems With Policy Iteration.

Finite Horizon Optimal Control of Non-Linear Discrete-Time Switched Systems Using Adaptive Dynamic Programming with Ε-Error Bound.

Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems

Model-free Adaptive Dynamic Programming for Optimal Control of Discrete-time Affine Nonlinear System

Finite-approximation-error-based Discrete-Time Iterative Adaptive Dynamic Programming.

Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems

An Adaptive Dynamic Programming Algorithm to Solve Optimal Control of Uncertain Nonlinear Systems

Nearly Finite-Horizon Optimal Control for A Class of Nonaffine Time-Delay Nonlinear Systems Based on Adaptive Dynamic Programming

Theoretical and Numerical Analysis of Approximate Dynamic Programming with Approximation Errors

Data-driven iterative adaptive dynamic programming algorithm for approximate optimal control of unknown nonlinear systems

Discrete-Time Optimal Control Via Local Policy Iteration Adaptive Dynamic Programming

Finite‐Horizon Ε‐optimal Tracking Control of Discrete‐Time Linear Systems Using Iterative Approximate Dynamic Programming

Revisiting approximate dynamic programming and its convergence

Online Approximate Optimal Control for Affine Non-Linear Systems with Unknown Internal Dynamics Using Adaptive Dynamic Programming

A Novel Stable Value Iteration-Based Approximate Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems

Infinite Horizon Optimal Control of Affine Nonlinear Discrete Switched Systems Using Two-Stage Approximate Dynamic Programming

Adaptive Dynamic Programming for Nonlinear-Constrained H∞ Control

Finite Convergence of Value Iteration Algorithm for Discounted Infinite Horizon Optimal Control of Stochastic Logical Systems

Approximate Dynamic Programming for Nonlinear-Constrained Optimizations

An Approximate Neuro-Optimal Solution of Discounted Guaranteed Cost Control Design