Abstract:This paper develops a novel Adaptive Dynammic Programming based approach, in that the focus is on minimizing the consecutive changes in control inputs over a finite horizon to solve the optimal tracking problem for completely unknown discrete time systems. Through suitable system transformation, the optimal tracking problem is transformed to a regulation problem with respect to state tracking error. The latter leads to a novel performance index function over finite horizon and corresponding nonlinear HJB equation that is solved in an approximative iterative sense using a novel iterative ADP‐based algorithm. Adaptive dynamic programming (ADP) based approaches are effective for solving nonlinear Hamilton–Jacobi–Bellman (HJB) in an approximative sense. This paper develops a novel ADP‐based approach, in that the focus is on minimizing the consecutive changes in control inputs over a finite horizon to solve the optimal tracking problem for completely unknown discrete time systems. To that end, the cost function considers within its arguments: tracking performance, energy consumption and as a novelty, consecutive changes in the control inputs. Through suitable system transformation, the optimal tracking problem is transformed to a regulation problem with respect to state tracking error. The latter leads to a novel performance index function over finite horizon and corresponding nonlinear HJB equation that is solved in an approximative iterative sense using a novel iterative ADP‐based algorithm. A suitable Neural network‐based structure is proposed to learn the initial admissible one step zero control law. The proposed iterative ADP is implemented using heuristic dynamic programming technique based on actor‐critic Neural Network structure. Finally, simulation studies are presented to illustrate the effectiveness of the proposed algorithm.

Model-based reinforcement learning for infinite-horizon approximate optimal tracking

A Learning-Based Optimal Tracking Controller for Continuous Linear Systems with Unknown Dynamics: Theory and Case Study

Efficient model-based reinforcement learning for approximate online optimal

Online reinforcement learning control of unknown nonaffine nonlinear discrete time systems

Data-based reinforcement learning approximate optimal control for an uncertain nonlinear system with control effectiveness faults

Reinforcement Learning for Finite-Horizon H∞ Tracking Control of Unknown Discrete Linear Time-Varying System

Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks

Data-Driven Near-Optimal Control of Nonlinear Systems Over Finite Horizon

Adaptive critic-based tracking control of non-affine nonlinear discrete-time systems with unknown dynamics

Data-Efficient Off-Policy Learning for Distributed Optimal Tracking Control of HMAS with Unidentified Exosystem Dynamics.

Online Multi-Objective Model-Independent Adaptive Tracking Mechanism for Dynamical Systems

Adaptive Learning-Based Path-Tracking Control for Unknown Vehicle Systems under Performance Optimization

Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems Using Online Approximators

Online inverse reinforcement learning with unknown disturbances

On optimal tracking portfolio in incomplete markets: The reinforcement learning approach

Constrained Reinforcement Learning-Based Closed-Loop Reference Model for Optimal Tracking Control of Unknown Continuous-Time Systems

Optimal Asymptotic Tracking Control for Nonzero-Sum Differential Game Systems with Unknown Drift Dynamics via Integral Reinforcement Learning

Integral reinforcement learning‐based optimal tracking control for uncertain nonlinear systems under input constraint and specified performance constraints

Model‐free optimal tracking over finite horizon using adaptive dynamic programming

Model-Free Optimal Tracking Design With Evolving Control Strategies via Q-Learning

Model-based inverse reinforcement learning for deterministic systems