Abstract:This paper develops a novel Adaptive Dynammic Programming based approach, in that the focus is on minimizing the consecutive changes in control inputs over a finite horizon to solve the optimal tracking problem for completely unknown discrete time systems. Through suitable system transformation, the optimal tracking problem is transformed to a regulation problem with respect to state tracking error. The latter leads to a novel performance index function over finite horizon and corresponding nonlinear HJB equation that is solved in an approximative iterative sense using a novel iterative ADP‐based algorithm. Adaptive dynamic programming (ADP) based approaches are effective for solving nonlinear Hamilton–Jacobi–Bellman (HJB) in an approximative sense. This paper develops a novel ADP‐based approach, in that the focus is on minimizing the consecutive changes in control inputs over a finite horizon to solve the optimal tracking problem for completely unknown discrete time systems. To that end, the cost function considers within its arguments: tracking performance, energy consumption and as a novelty, consecutive changes in the control inputs. Through suitable system transformation, the optimal tracking problem is transformed to a regulation problem with respect to state tracking error. The latter leads to a novel performance index function over finite horizon and corresponding nonlinear HJB equation that is solved in an approximative iterative sense using a novel iterative ADP‐based algorithm. A suitable Neural network‐based structure is proposed to learn the initial admissible one step zero control law. The proposed iterative ADP is implemented using heuristic dynamic programming technique based on actor‐critic Neural Network structure. Finally, simulation studies are presented to illustrate the effectiveness of the proposed algorithm.

Learning-Based N-Step Heuristic Dynamic Programming for Affine Nonlinear Optimal Regulation

Model-free Adaptive Dynamic Programming for Optimal Control of Discrete-time Affine Nonlinear System

Adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal Learning Control

Event-Triggered Control of Nonlinear Discrete-Time System With Unknown Dynamics Based on HDP(λ)

Intelligent Optimal Control of Constrained Nonlinear Systems Via Receding-Horizon Heuristic Dynamic Programming

Twin Deterministic Policy Gradient Adaptive Dynamic Programming for Optimal Control of Affine Nonlinear Discrete-time Systems

Approximately Optimal Control of Discrete-Time Nonlinear Switched Systems Using Globalized Dual Heuristic Programming

Direct Heuristic Dynamic Programming Based on an Improved PID Neural Network

Model‐free optimal tracking over finite horizon using adaptive dynamic programming

Discrete-Time Adaptive Iterative Learning Control for High-Order Nonlinear Systems with Unknown Control Directions

Convergence and Stability of Optimal Regulation via Generalized N-Step Value Gradient Learning

Approximate dynamic programming for continuous state and control problems

Optimal control of nonlinear system based on deterministic policy gradient with eligibility traces

Model-Free Incremental Adaptive Dynamic Programming Based Approximate Robust Optimal Regulation

Direct Heuristic Dynamic Programming with Augmented States

Novel iterative neural dynamic programming for data-based approximate optimal control design

Policy-Iteration-Based Finite-Horizon Approximate Dynamic Programming for Continuous-Time Nonlinear Optimal Control

A Digital Receding-Horizon Learning Controller for Nonlinear Continuous-time Systems

Adaptive neural event‐triggered near‐optimal control for affined uncertain nonlinear discrete‐time system

Data-Driven Tracking Control With Adaptive Dynamic Programming for a Class of Continuous-Time Nonlinear Systems