Abstract:Sampling-based methods have become a cornerstone of contemporary approaches to Model Predictive Control (MPC), as they make no restrictions on the differentiability of the dynamics or cost function and are straightforward to parallelize. However, their efficacy is highly dependent on the quality of the sampling distribution itself, which is often assumed to be simple, like a Gaussian. This restriction can result in samples which are far from optimal, leading to poor performance. Recent work has explored improving the performance of MPC by sampling in a learned latent space of controls. However, these methods ultimately perform all MPC parameter updates and warm-starting between time steps in the control space. This requires us to rely on a number of heuristics for generating samples and updating the distribution and may lead to sub-optimal performance. Instead, we propose to carry out all operations in the latent space, allowing us to take full advantage of the learned distribution. Specifically, we frame the learning problem as bi-level optimization and show how to train the controller with backpropagation-through-time. By using a normalizing flow parameterization of the distribution, we can leverage its tractable density to avoid requiring differentiability of the dynamics and cost function. Finally, we evaluate the proposed approach on simulated robotics tasks and demonstrate its ability to surpass the performance of prior methods and scale better with a reduced number of samples.

Temporal Difference Learning for Model Predictive Control

Learning-Based Hierarchical Model Predictive Control for Drift Vehicles

Triple-Mode Model Predictive Control Using Future Target Information

Temporal Difference Models: Model-Free Deep RL for Model-Based Control

Recursive model predictive control for fast varying dynamic systems

Learning-Based Neural Dynamic Surface Predictive Control for MMC

Lebesgue-Approximation Model Predictive Control of Nonlinear Sampled-Data Systems

Learning Sampling Distributions for Model Predictive Control

DiffTune-MPC: Closed-Loop Learning for Model Predictive Control

Per-decision Multi-step Temporal Difference Learning with Control Variates

Model Predictive Control with Variational Autoencoders for Signal Temporal Logic Specifications

Goal-Conditioned Terminal Value Estimation for Real-time and Multi-task Model Predictive Control

Data-Driven Multi-Modal Learning Model Predictive Control

Discrete-time Finite Horizon Adaptive Dynamic Programming for Autonomous Vehicle Control

Differentiable MPC for End-to-end Planning and Control

Fast Trajectory Tracking Control Algorithm for Autonomous Vehicles Based on the Alternating Direction Multiplier Method (ADMM) to the Receding Optimization of Model Predictive Control (MPC)

Model Controlled Prediction: A Reciprocal Alternative of Model Predictive Control

Learn Proportional Derivative Controllable Latent Space from Pixels

MPC-TD3 Trajectory Tracking Control for Electrically Driven Unmanned Tracked Vehicles

Synthesis of model predictive control based on data-driven learning

Robust and efficient data-driven predictive control