On Provably Robust Meta-Bayesian Optimization

Zhongxiang Dai,Yizhou Chen,Haibin Yu,Bryan Kian Hsiang Low,Patrick Jaillet
DOI: https://doi.org/10.48550/arXiv.2206.06872
2022-06-16
Abstract:Bayesian optimization (BO) has become popular for sequential optimization of black-box functions. When BO is used to optimize a target function, we often have access to previous evaluations of potentially related functions. This begs the question as to whether we can leverage these previous experiences to accelerate the current BO task through meta-learning (meta-BO), while ensuring robustness against potentially harmful dissimilar tasks that could sabotage the convergence of BO. This paper introduces two scalable and provably robust meta-BO algorithms: robust meta-Gaussian process-upper confidence bound (RM-GP-UCB) and RM-GP-Thompson sampling (RM-GP-TS). We prove that both algorithms are asymptotically no-regret even when some or all previous tasks are dissimilar to the current task, and show that RM-GP-UCB enjoys a better theoretical robustness than RM-GP-TS. We also exploit the theoretical guarantees to optimize the weights assigned to individual previous tasks through regret minimization via online learning, which diminishes the impact of dissimilar tasks and hence further enhances the robustness. Empirical evaluations show that (a) RM-GP-UCB performs effectively and consistently across various applications, and (b) RM-GP-TS, despite being less robust than RM-GP-UCB both in theory and in practice, performs competitively in some scenarios with less dissimilar tasks and is more computationally efficient.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that when using Bayesian Optimization (BO) to optimize the objective function, how to utilize the experience of previously evaluated potentially related functions to accelerate the current BO task, while ensuring robustness against potentially harmful different tasks to prevent these different tasks from disrupting the convergence of BO. Specifically, the paper proposes two scalable and theoretically guaranteed robust meta - Bayesian optimization algorithms: the Robust Meta - Gaussian Process Upper Confidence Bound (RM - GP - UCB) and the Robust Meta - Gaussian Process Thompson Sampling (RM - GP - TS). These two algorithms can guarantee asymptotically no - regret even when some or all of the previous tasks are not similar to the current task, and optimize the weights assigned to each previous task by minimizing the regret upper bound through online learning, thereby reducing the influence of different tasks and further enhancing robustness.