Batched Gaussian Process Bandit Optimization via Determinantal Point Processes

Tarun Kathuria,Amit Deshpande,Pushmeet Kohli

DOI: https://doi.org/10.48550/arXiv.1611.04088

2016-11-13

Abstract:Gaussian Process bandit optimization has emerged as a powerful tool for optimizing noisy black box functions. One example in machine learning is hyper-parameter optimization where each evaluation of the target function requires training a model which may involve days or even weeks of computation. Most methods for this so-called "Bayesian optimization" only allow sequential exploration of the parameter space. However, it is often desirable to propose batches or sets of parameter values to explore simultaneously, especially when there are large parallel processing facilities at our disposal. Batch methods require modeling the interaction between the different evaluations in the batch, which can be expensive in complex scenarios. In this paper, we propose a new approach for parallelizing Bayesian optimization by modeling the diversity of a batch via Determinantal point processes (DPPs) whose kernels are learned automatically. This allows us to generalize a previous result as well as prove better regret bounds based on DPP sampling. Our experiments on a variety of synthetic and real-world robotics and hyper-parameter optimization tasks indicate that our DPP-based methods, especially those based on DPP sampling, outperform state-of-the-art methods.

Machine Learning

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to effectively conduct batch exploration in Bayesian optimization, especially in scenarios with abundant parallel computing resources. Traditional Bayesian optimization methods usually explore only one parameter value at a time, which is inefficient when a large amount of time is required for evaluation. For example, in the hyper - parameter optimization tasks in machine learning, each evaluation of the objective function may take days or even weeks. Therefore, it becomes particularly important to propose batch methods to explore multiple parameter values simultaneously. However, batch methods need to model the interactions between different evaluations, which can be very costly in complex scenarios. To overcome these challenges, this paper proposes a new method to select diverse batch sample sets by using Determinantal Point Processes (DPPs). DPPs are a kind of probability measure that can promote the diversity of the selected subsets and have efficient sampling algorithms. This method can not only generalize previous results but also prove better regret bounds based on DPP sampling. Experimental results show that DPP - based methods, especially those based on DPP sampling, outperform the existing state - of - the - art methods in various synthetic and real - world robotics and hyper - parameter optimization tasks.

Batched Gaussian Process Bandit Optimization via Determinantal Point Processes

Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification

Gaussian Process Bandit Optimization with Few Batches

Diversified Sampling for Batched Bayesian Optimization with Determinantal Point Processes

Parallelizing Exploration-Exploitation Tradeoffs with Gaussian Process Bandit Optimization

Provably Efficient Bayesian Optimization with Unbiased Gaussian Process Hyperparameter Estimation

Gaussian One-Armed Bandit and Optimization of Batch Data Processing

On Batch Bayesian Optimization

Differentiating the multipoint Expected Improvement for optimal batch design

Bayesian Optimization with High-Dimensional Outputs

Distributed Gaussian Processes Hyperparameter Optimization for Big Data Using Proximal ADMM

Parallel Gaussian Process Optimization with Upper Confidence Bound and Pure Exploration

Batch Bayesian Optimization via Particle Gradient Flows

Scalable Bayesian Optimization Using Vecchia Approximations of Gaussian Processes

Hyper-optimization with Gaussian Process and Differential Evolution Algorithm

Vector Optimization with Gaussian Process Bandits

Time-Varying Gaussian Process Bandits with Unknown Prior

Random Exploration in Bayesian Optimization: Order-Optimal Regret and Computational Efficiency

Sampling from Gaussian Process Posteriors using Stochastic Gradient Descent

Optimal Designs of Gaussian Processes with Budgets for Hyperparameter Optimization

Sample-Then-Optimize Batch Neural Thompson Sampling