Principled Bayesian Optimisation in Collaboration with Human Experts

Wenjie Xu,Masaki Adachi,Colin N. Jones,Michael A. Osborne
2024-10-14
Abstract:Bayesian optimisation for real-world problems is often performed interactively with human experts, and integrating their domain knowledge is key to accelerate the optimisation process. We consider a setup where experts provide advice on the next query point through binary accept/reject recommendations (labels). Experts' labels are often costly, requiring efficient use of their efforts, and can at the same time be unreliable, requiring careful adjustment of the degree to which any expert is trusted. We introduce the first principled approach that provides two key guarantees. (1) Handover guarantee: similar to a no-regret property, we establish a sublinear bound on the cumulative number of experts' binary labels. Initially, multiple labels per query are needed, but the number of expert labels required asymptotically converges to zero, saving both expert effort and computation time. (2) No-harm guarantee with data-driven trust level adjustment: our adaptive trust level ensures that the convergence rate will not be worse than the one without using advice, even if the advice from experts is adversarial. Unlike existing methods that employ a user-defined function that hand-tunes the trust level adjustment, our approach enables data-driven adjustments. Real-world applications empirically demonstrate that our method not only outperforms existing baselines, but also maintains robustness despite varying labelling accuracy, in tasks of battery design with human experts.
Machine Learning,Optimization and Control
What problem does this paper attempt to address?
This paper attempts to solve several key problems faced when Bayesian Optimisation (BO) collaborates with human experts in practical applications. Specifically: 1. **Effective utilization of expert advice**: How to effectively integrate the domain knowledge of human experts to accelerate the optimization process. Experts' knowledge is often qualitative, difficult to accurately express in a functional form, and may be inaccurate. 2. **Reducing the cost of expert participation**: The binary accept/reject labels provided by experts are costly, so their efforts need to be used efficiently while minimizing dependence on experts. 3. **Handling unreliable expert opinions**: Expert opinions may be unreliable or even adversarial, so a mechanism is required to adjust the trust level in experts to ensure that the optimization process is not damaged by wrong expert opinions. To this end, the author proposes a principled method with two key guarantees: - **Handover guarantee**: By establishing a sublinear bound on the cumulative number of expert binary labels, the number of queries that initially require multiple labels gradually decreases to zero, thus saving the efforts of experts and computing time. - **No - harm guarantee**: Even if the expert's advice is wrong or adversarial, the convergence speed of this algorithm will not be worse than that of pure Bayesian optimization without using expert advice. This trust - level adjustment is data - driven rather than relying on a user - manually - tuned function. In addition, this method has been tested in practical tasks such as lithium - ion battery design, and the results show that it is not only superior to existing baseline methods but also remains robust in the face of different label accuracies.