Abstract:Optimizing expensive-to-evaluate black-box functions of discrete (and potentially continuous) design parameters is a ubiquitous problem in scientific and engineering applications. Bayesian optimization (BO) is a popular, sample-efficient method that leverages a probabilistic surrogate model and an acquisition function (AF) to select promising designs to evaluate. However, maximizing the AF over mixed or high-cardinality discrete search spaces is challenging standard gradient-based methods cannot be used directly or evaluating the AF at every point in the search space would be computationally prohibitive. To address this issue, we propose using probabilistic reparameterization (PR). Instead of directly optimizing the AF over the search space containing discrete parameters, we instead maximize the expectation of the AF over a probability distribution defined by continuous parameters. We prove that under suitable reparameterizations, the BO policy that maximizes the probabilistic objective is the same as that which maximizes the AF, and therefore, PR enjoys the same regret bounds as the original BO policy using the underlying AF. Moreover, our approach provably converges to a stationary point of the probabilistic objective under gradient ascent using scalable, unbiased estimators of both the probabilistic objective and its gradient. Therefore, as the number of starting points and gradient steps increase, our approach will recover of a maximizer of the AF (an often-neglected requisite for commonly used BO regret bounds). We validate our approach empirically and demonstrate state-of-the-art optimization performance on a wide range of real-world applications. PR is complementary to (and benefits) recent work and naturally generalizes to settings with multiple objectives and black-box constraints.

RLBOF: Reinforcement Learning from Bayesian Optimization Feedback

Bayesian Optimization Based on Pseudo Labels

Behavior Proximal Policy Optimization

MALIBO: Meta-learning for Likelihood-free Bayesian Optimization

Large Language Models to Enhance Bayesian Optimization

Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning

On Provably Robust Meta-Bayesian Optimization

Non-myopic Bayesian optimization using model-free reinforcement learning and its application to optimization in electrochemistry

Enhanced Bayesian Optimization via Preferential Modeling of Abstract Properties

Transfer Learning for Bayesian Optimization: A Survey

Modulating Surrogates for Bayesian Optimization

Approximating Pareto Frontier Through Bayesian-optimization-directed Robust Multi-objective Reinforcement Learning

Reinforced In-Context Black-Box Optimization

Bayesian Optimization over Discrete and Mixed Spaces via Probabilistic Reparameterization

Poisson Process for Bayesian Optimization

Deep Kernel Learning-Based Bayesian Optimization with Adaptive Kernel Functions

Optimizing Closed-Loop Performance with Data from Similar Systems: A Bayesian Meta-Learning Approach

Doubly Bayesian Optimization

Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for LLM Alignment

How Useful is Intermittent, Asynchronous Expert Feedback for Bayesian Optimization?

Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator