Abstract:The paper proposes a method to reduce MPC computation by shortening the prediction horizon using an approximate value function. A goal‐oriented sampling strategy is introduced that incorporates function values and gradients to train the value function, enabling efficient online optimization. Model predictive control (MPC) is a well‐developed method capable of handling complex control tasks. Implementation of an MPC requires solution of a deterministic finite horizon nonlinear optimal control (OC) problem. OC problems can be solved globally, explicitly expressing the optimal policy (and value function) as a function of the present state, or, locally, using online trajectory optimization, generating a solution that is only valid for the present state. For nonlinear problems however, neither is possible analytically, and the latter, finding a local solution online, is usually preferred. The difficulty with online trajectory optimization is that the solution must be available within a single sampling period. The main parameter affecting the computational demand of MPC is the length of the prediction horizon. The goal in this work is to reduce the length of the prediction horizon of a model predictive controller (MPC) to reduce computation time whilst preserving optimality guarantees. To this end, we propose approximation of the trajectory optimization problem for MPC by learning a finite horizon value function. The approximated value function is inserted into a truncated trajectory optimization problem so that the MPC can be attained with a reduced prediction horizon, ergo reduced computational load. By sampling the value function in a goal oriented way, we show that an effective approximate value function can be found by including both the function value and the gradients of the value function. The result is an accurate approximate MPC which leverages learning methodologies to reduce computational cost while still accounting for constraints.

Value Approximator-based Learning Model Predictive Control for Iterative Tasks

Learning‐based Model Predictive Control under Value Iteration with Finite Approximation Errors

Nontracking type iterative learning control based on economic model predictive control

Learning-based adaptive optimal control of linear time-delay systems: A value iteration approach

A High-Order Internal Model Based Iterative Learning Control Scheme for Discrete Linear Time-Varying Systems

Iterative Learning Control Of Varying Trajectories For Robot Manipulators

Predictive Control with Learning-Based Terminal Costs Using Approximate Value Iteration

Iterative learning controllers with time-varying gains for large-scale industrial processes to track trajectories with different magnitudes

Stability Analysis of Optimal Adaptive Control using Value Iteration with Approximation Errors

Linear Quadratic Tracking Control of Unknown Discrete-Time Systems Using Value Iteration Algorithm

Manifold Regularization Based Approximate Value Iteration For Learning Control

Approximate infinite-horizon predictive control

Iterative Learning Control for Linear Time-Variant Continuous Systems with Iteration-Varying Initial Conditions and Iteration-Varying Reference Trajectories.

Robust Predictive Iterative Learning Control for Linear Time‐varying Systems

Iterative Learning Economic Model Predictive Control.

General value iteration based single network approach for constrained optimal controller design of partially-unknown continuous-time nonlinear systems.

An Iterative Learning Control Algorithm Based on Predictive Model

Fast Nonlinear Model Predictive Control Combining Online Trajectory Optimization and Value Function Regression

Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems

Model Reference Adaptive Iterative Learning Control for a Class of Time-varying Systems

Adaptive Iterative Learning Control in Optimization of Industrial Process