Batched Gaussian Process Bandit Optimization via Determinantal Point Processes

Tarun Kathuria,Amit Deshpande,Pushmeet Kohli
DOI: https://doi.org/10.48550/arXiv.1611.04088
2016-11-13
Abstract:Gaussian Process bandit optimization has emerged as a powerful tool for optimizing noisy black box functions. One example in machine learning is hyper-parameter optimization where each evaluation of the target function requires training a model which may involve days or even weeks of computation. Most methods for this so-called "Bayesian optimization" only allow sequential exploration of the parameter space. However, it is often desirable to propose batches or sets of parameter values to explore simultaneously, especially when there are large parallel processing facilities at our disposal. Batch methods require modeling the interaction between the different evaluations in the batch, which can be expensive in complex scenarios. In this paper, we propose a new approach for parallelizing Bayesian optimization by modeling the diversity of a batch via Determinantal point processes (DPPs) whose kernels are learned automatically. This allows us to generalize a previous result as well as prove better regret bounds based on DPP sampling. Our experiments on a variety of synthetic and real-world robotics and hyper-parameter optimization tasks indicate that our DPP-based methods, especially those based on DPP sampling, outperform state-of-the-art methods.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively conduct batch exploration in Bayesian optimization, especially in scenarios with abundant parallel computing resources. Traditional Bayesian optimization methods usually explore only one parameter value at a time, which is inefficient when a large amount of time is required for evaluation. For example, in the hyper - parameter optimization tasks in machine learning, each evaluation of the objective function may take days or even weeks. Therefore, it becomes particularly important to propose batch methods to explore multiple parameter values simultaneously. However, batch methods need to model the interactions between different evaluations, which can be very costly in complex scenarios. To overcome these challenges, this paper proposes a new method to select diverse batch sample sets by using Determinantal Point Processes (DPPs). DPPs are a kind of probability measure that can promote the diversity of the selected subsets and have efficient sampling algorithms. This method can not only generalize previous results but also prove better regret bounds based on DPP sampling. Experimental results show that DPP - based methods, especially those based on DPP sampling, outperform the existing state - of - the - art methods in various synthetic and real - world robotics and hyper - parameter optimization tasks.