Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control

Sebastian Hirt,Lukas Theiner,Rolf Findeisen

2024-12-03

Abstract:Closed-loop performance of sequential decision making algorithms, such as model predictive control, depends strongly on the parameters of cost functions, models, and constraints. Bayesian optimization is a common approach to learning these parameters based on closed-loop experiments. However, traditional Bayesian optimization approaches treat the learning problem as a black box, ignoring valuable information and knowledge about the structure of the underlying problem, resulting in slow convergence and high experimental resource use. We propose a time-series-informed optimization framework that incorporates intermediate performance evaluations from early iterations of each experimental episode into the learning procedure. Additionally, probabilistic early stopping criteria are proposed to terminate unpromising experiments, significantly reducing experimental time. Simulation results show that our approach achieves baseline performance with approximately half the resources. Moreover, with the same resource budget, our approach outperforms the baseline in terms of final closed-loop performance, highlighting its efficiency in sequential decision making scenarios.

Systems and Control,Machine Learning

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the strong dependence of closed - loop performance on cost functions, models, and constraint parameters in sequential decision - making and control. When learning these parameters, traditional Bayesian optimization methods ignore valuable information about the underlying problem structure, resulting in slow convergence and high use of experimental resources. To this end, the authors propose an optimization framework based on time - series information. This framework incorporates intermediate performance evaluations in the early iterations of each experiment into the learning process and proposes a probabilistic early - stopping criterion to terminate unpromising experiments, thereby significantly reducing experimental time. Simulation results show that this method can achieve baseline performance with approximately half of the resources, and with the same resource budget, the final closed - loop performance is better than the baseline method, highlighting its efficiency in sequential decision - making scenarios. Specifically, the main contributions of the paper include: 1. **Bayesian Optimization with Time - Series Information (TSI - BO)**: Align the fidelity dimension of the surrogate model with the time axis of the closed - loop experiment. 2. **Probabilistic Decision Criteria Based on Upper Confidence Bound (UCB) and Expected Improvement (EI)**: Used for early stopping of unpromising experiments. 3. **Convergence - Based Stopping Criterion**: Utilize the information of the closed - loop trajectory to decide whether to terminate the experiment. These methods jointly improve the convergence speed, resource efficiency, and closed - loop performance of multi - fidelity Bayesian optimization in closed - loop performance optimization.

Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control

Novel Robust Predictive Control Algorithm Based on Closed-Loop Optimization

Safe and Stable Closed-Loop Learning for Neural-Network-Supported Model Predictive Control

Stability-informed Bayesian Optimization for MPC Cost Function Learning

Closed-Loop Finite-Time Analysis of Suboptimal Online Control

Machine learning based decision making for time varying systems: Parameter estimation and performance optimization

Optimizing Closed-Loop Performance with Data from Similar Systems: A Bayesian Meta-Learning Approach

Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks

Probabilistic design of optimal sequential decision-making algorithms in learning and control

On the Finite-Time Behavior of Suboptimal Linear Model Predictive Control

Learning-Based Optimal Control with Performance Guarantees for Unknown Systems with Latent States

Metacontrol for Adaptive Imagination-Based Optimization

Using Bayesian Optimization to Design Time Step Size Controllers with Application to Modified Patankar--Runge--Kutta Methods

Sequential learning and control: Targeted exploration for robust performance

Data-Driven Near-Optimal Control of Nonlinear Systems Over Finite Horizon

Bayesian Learning Approach to Model Predictive Control

Closed-loop Analysis of ADMM-based Suboptimal Linear Model Predictive Control

Optimal Exploration for Model-Based RL in Nonlinear Systems

Temporal Difference Learning for Model Predictive Control

Actively Learning Reinforcement Learning: A Stochastic Optimal Control Approach

Knowledge-based modeling of simulation behavior for Bayesian optimization