Abstract:Summary This article introduces a novel optimal trajectory tracking control scheme designed for uncertain linear discrete‐time (DT) systems. In contrast to traditional tracking control methods, our approach removes the requirement for the reference trajectory to align with the generator dynamics of an autonomous dynamical system. Moreover, it does not demand the complete desired trajectory to be known in advance, whether through the generator model or any other means. Instead, our approach can dynamically incorporate segments (finite horizons) of reference trajectories and autonomously learn an optimal control policy to track them in real time. To achieve this, we address the tracking problem by learning a time‐varying ‐function through state feedback. This ‐function is then utilized to calculate the optimal feedback gain and explicitly time‐varying feedforward control input, all without the need for prior knowledge of the system dynamics or having the complete reference trajectory in advance. Additionally, we introduce an adaptive observer to extend the applicability of the tracking control scheme to situations where full state measurements are unavailable. We rigorously establish the closed‐loop stability of our optimal adaptive control approach, both with and without the adaptive observer, employing Lyapunov theory. Moreover, we characterize the optimality of the controller with respect to the finite horizon length of the known components of the desired trajectory. To further enhance the controller's adaptability and effectiveness in multitask environments, we employ the Efficient Lifelong Learning Algorithm, which leverages a shared knowledge base within the recursive least squares algorithm for multitask ‐learning. The efficacy of our approach is substantiated through a comprehensive set of simulation results by using a power system example.

Off-Policy Reinforcement Learning for Tracking in Continuous-Time Systems on Two Time Scales

A Learning-Based Optimal Tracking Controller for Continuous Linear Systems with Unknown Dynamics: Theory and Case Study

Operational Optimal Tracking Control for Industrial Multirate Systems Subject to Unknown Disturbances

Reinforcement learning for optimal tracking of large-scale systems with multitime scales

Reinforcement Learning for Input Constrained Sub-optimal Tracking Control in Discrete-time Two-time-scale Systems

Fixed-Time Tracking Control for State-Constrained Nonstrict-Feedback Systems Without Feasibility Conditions

Asymptotic Tracking Controller Design for Nonlinear Systems with Guaranteed Performance.

Reinforcement Learning Tracking Control for Unknown Continuous Dynamic Systems

H ∞ Reference Tracking Control Design for a Class of Nonlinear Systems with Time-Varying Delays

Off-policy two-dimensional reinforcement learning for optimal tracking control of batch processes with network-induced dropout and disturbances

Robust Output Regulation and Reinforcement Learning-Based Output Tracking Design for Unknown Linear Discrete-Time Systems

Reinforcement Learning for Finite-Horizon H∞ Tracking Control of Unknown Discrete Linear Time-Varying System

Novel two-dimensional off-policy Q -learning method for output feedback optimal tracking control of batch process with unknown dynamics

Quadratic Tracking Control of Linear Stochastic Systems with Unknown Dynamics Using Average Off-Policy Q-Learning Method

Constrained Reinforcement Learning-Based Closed-Loop Reference Model for Optimal Tracking Control of Unknown Continuous-Time Systems

Novel data-driven two-dimensional Q-learning for optimal tracking control of batch process with unknown dynamics

Optimal trajectory tracking for uncertain linear discrete‐time systems using time‐varying Q‐learning

Two-Time Scale Tracking Control of Flexible Robots With Primal-Dual Inverse Reinforcement Learning

Robust Tracking Control for Nonlinear Systems: Performance optimization via extremum seeking

Optimal Asymptotic Tracking Control for Nonzero-Sum Differential Game Systems with Unknown Drift Dynamics via Integral Reinforcement Learning

Learning-based optimal control of linear time-varying systems over large time intervals