Abstract:Summary This article introduces a novel optimal trajectory tracking control scheme designed for uncertain linear discrete‐time (DT) systems. In contrast to traditional tracking control methods, our approach removes the requirement for the reference trajectory to align with the generator dynamics of an autonomous dynamical system. Moreover, it does not demand the complete desired trajectory to be known in advance, whether through the generator model or any other means. Instead, our approach can dynamically incorporate segments (finite horizons) of reference trajectories and autonomously learn an optimal control policy to track them in real time. To achieve this, we address the tracking problem by learning a time‐varying ‐function through state feedback. This ‐function is then utilized to calculate the optimal feedback gain and explicitly time‐varying feedforward control input, all without the need for prior knowledge of the system dynamics or having the complete reference trajectory in advance. Additionally, we introduce an adaptive observer to extend the applicability of the tracking control scheme to situations where full state measurements are unavailable. We rigorously establish the closed‐loop stability of our optimal adaptive control approach, both with and without the adaptive observer, employing Lyapunov theory. Moreover, we characterize the optimality of the controller with respect to the finite horizon length of the known components of the desired trajectory. To further enhance the controller's adaptability and effectiveness in multitask environments, we employ the Efficient Lifelong Learning Algorithm, which leverages a shared knowledge base within the recursive least squares algorithm for multitask ‐learning. The efficacy of our approach is substantiated through a comprehensive set of simulation results by using a power system example.

Optimal tracking control of batch processes with time-invariant state delay: Adaptive Q-learning with two-dimensional state and control policy

Novel data-driven two-dimensional Q-learning for optimal tracking control of batch process with unknown dynamics

Novel two-dimensional off-policy Q -learning method for output feedback optimal tracking control of batch process with unknown dynamics

A Learning-Based Optimal Tracking Controller for Continuous Linear Systems with Unknown Dynamics: Theory and Case Study

Iterative Learning Tracking Control of High-Speed Trains with Nonlinearly Parameterized Uncertainties and Multiple Time-Varying Delays

Two-dimensional model-free Q-learning-based output feedback fault-tolerant control for batch processes

Adaptive Learning-Based Path-Tracking Control for Unknown Vehicle Systems under Performance Optimization

Off-policy two-dimensional reinforcement learning for optimal tracking control of batch processes with network-induced dropout and disturbances

Robust two-dimensional iterative learning control for batch processes with state delay and time-varying uncertainties

A Hybrid 2D Fault-Tolerant Controller Design for Multi-Phase Batch Processes with Time Delay

Data-Efficient Constrained Learning for Optimal Tracking of Batch Processes

Model-Free Optimal Tracking Design With Evolving Control Strategies via Q-Learning

Terminal Constrained Robust Hybrid Iterative Learning Model Predictive Control for Complex Time-Delayed Batch Processes

Delay-range-dependent robust 2D iterative learning control for batch processes with state delay and uncertainties

Adaptive Finite-Time-Based Neural Optimal Control of Time-Delayed Wheeled Mobile Robotics Systems

Optimal trajectory tracking for uncertain linear discrete‐time systems using time‐varying Q‐learning

Two-Dimensional Iterative Learning Model Predictive Control for Batch Processes: A New State Space Model Compensation Approach.

2-D theory based integrated predictive iterative learning control for batch process

Integrated Tracking Control For Batch Processes In The Presence Of Model Uncertainties

Optimal Iterative Learning Control for Batch Processes Based on Linear Time-varying Perturbation Model

Delay‐range‐dependent Guaranteed Cost Control for Batch Processes with State Delay