Abstract:Expected Improvement (EI) is arguably the most widely used acquisition function in Bayesian optimization. However, it is often challenging to enhance the performance with EI due to its sensitivity to numerical precision. Previously, Hutter et al. (2009) tackled this problem by using Gaussian process trained on the log-transformed objective function and it was reported that this trick improves the predictive accuracy of GP, leading to substantially better performance. Although Hutter et al. (2009) offered the closed form of their EI, its intermediate derivation has not been provided so far. In this paper, we give a friendly derivation of their proposition.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to provide the closed - form solution derivation of Expected Improvement (EI), especially for Gaussian Process (GP) trained with a log - transformed objective function. Specifically: 1. **Problem Background**: - Expected Improvement (EI) is one of the most commonly used acquisition functions in Bayesian optimization. - However, due to its sensitivity to numerical precision, improving the performance of EI is often challenging. - Hutter et al. (2009) solved this problem by training GP with a log - transformed objective function and reported that this method significantly improved the prediction accuracy of GP, thus enhancing the performance. 2. **Unsolved Problems**: - Although Hutter et al. (2009) provided the closed - form solution of EI, its derivation process was not given in detail. 3. **Main Contributions of the Paper**: - The author of this paper provided the detailed derivation process of the closed - form solution of EI proposed by Hutter et al. (2009). - This helps practitioners verify their implementations and provides a reference for further improving this acquisition function in the future. ### Formula Summary - **Standard Normal Distribution Function**: \(\Phi(z)=\int_{-\infty}^{z}\frac{1}{\sqrt{2\pi}}e^{-\frac{x^{2}}{2}}dx\) - **Probability Density Function**: \(\phi(z)=\frac{1}{\sqrt{2\pi}}e^{-\frac{z^{2}}{2}}\) #### Closed - form Solution of Traditional EI \[ \text{EI}(\mathbf{x}|D)=(\max_{i = 1}^{n}y_{i}-\mu)\Phi(\alpha)+\sigma\phi(\alpha) \] where: \[ \alpha=\frac{\max_{i = 1}^{n}y_{i}-\mu}{\sigma} \] #### Closed - form Solution of Log - Transformed EI \[ \text{logEI}(\mathbf{x}|D)=y_{\text{best}}\Phi(\alpha)-e^{\mu+\frac{\sigma^{2}}{2}}\Phi(\alpha-\sigma) \] where: \[ \alpha=\frac{\log(y_{\text{best}})-\mu}{\sigma} \] Through these formulas, the author detailedly derived the expression of log - transformed EI, filling in the intermediate steps not provided by Hutter et al. (2009). This not only helps to understand the effectiveness of this method but also provides theoretical support for practical applications.

Derivation of Closed Form of Expected Improvement for Gaussian Process Trained on Log-Transformed Objective

Regret Bounds for Expected Improvement Algorithms in Gaussian Process Bandit Optimization

Unexpected Improvements to Expected Improvement for Bayesian Optimization

A hierarchical expected improvement method for Bayesian optimization

Active Design of Dynamic GP Models for Model Predictive Control Using Expected Improvement

Improving the Expected Improvement Algorithm

Probability Distribution of Hypervolume Improvement in Bi-objective Bayesian Optimization

Faster Computation of Expected Hypervolume Improvement

Exploiting gradients and Hessians in Bayesian optimization and Bayesian quadrature

Global Optimization of Gaussian Process Acquisition Functions Using a Piecewise-Linear Kernel Approximation

A Solution to the Ill-Conditioning of Gradient-Enhanced Covariance Matrices for Gaussian Processes

Differentiable Expected Hypervolume Improvement for Parallel Multi-Objective Bayesian Optimization

Global Optimization of Gaussian processes

Scaling Gaussian Process Regression with Derivatives

Distributionally Ambiguous Optimization Techniques for Batch Bayesian Optimization

Adjusted Expected Improvement for Cumulative Regret Minimization in Noisy Bayesian Optimization

Warm Start Marginal Likelihood Optimisation for Iterative Gaussian Processes

Improving Hyperparameter Learning under Approximate Inference in Gaussian Process Models

Scalable Bayesian Optimization Using Vecchia Approximations of Gaussian Processes

Regret Optimality of GP-UCB

Relative Expected Improvement in Kriging Based Optimization