Derivation of Closed Form of Expected Improvement for Gaussian Process Trained on Log-Transformed Objective

Shuhei Watanabe
2024-11-27
Abstract:Expected Improvement (EI) is arguably the most widely used acquisition function in Bayesian optimization. However, it is often challenging to enhance the performance with EI due to its sensitivity to numerical precision. Previously, Hutter et al. (2009) tackled this problem by using Gaussian process trained on the log-transformed objective function and it was reported that this trick improves the predictive accuracy of GP, leading to substantially better performance. Although Hutter et al. (2009) offered the closed form of their EI, its intermediate derivation has not been provided so far. In this paper, we give a friendly derivation of their proposition.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to provide the closed - form solution derivation of Expected Improvement (EI), especially for Gaussian Process (GP) trained with a log - transformed objective function. Specifically: 1. **Problem Background**: - Expected Improvement (EI) is one of the most commonly used acquisition functions in Bayesian optimization. - However, due to its sensitivity to numerical precision, improving the performance of EI is often challenging. - Hutter et al. (2009) solved this problem by training GP with a log - transformed objective function and reported that this method significantly improved the prediction accuracy of GP, thus enhancing the performance. 2. **Unsolved Problems**: - Although Hutter et al. (2009) provided the closed - form solution of EI, its derivation process was not given in detail. 3. **Main Contributions of the Paper**: - The author of this paper provided the detailed derivation process of the closed - form solution of EI proposed by Hutter et al. (2009). - This helps practitioners verify their implementations and provides a reference for further improving this acquisition function in the future. ### Formula Summary - **Standard Normal Distribution Function**: \(\Phi(z)=\int_{-\infty}^{z}\frac{1}{\sqrt{2\pi}}e^{-\frac{x^{2}}{2}}dx\) - **Probability Density Function**: \(\phi(z)=\frac{1}{\sqrt{2\pi}}e^{-\frac{z^{2}}{2}}\) #### Closed - form Solution of Traditional EI \[ \text{EI}(\mathbf{x}|D)=(\max_{i = 1}^{n}y_{i}-\mu)\Phi(\alpha)+\sigma\phi(\alpha) \] where: \[ \alpha=\frac{\max_{i = 1}^{n}y_{i}-\mu}{\sigma} \] #### Closed - form Solution of Log - Transformed EI \[ \text{logEI}(\mathbf{x}|D)=y_{\text{best}}\Phi(\alpha)-e^{\mu+\frac{\sigma^{2}}{2}}\Phi(\alpha-\sigma) \] where: \[ \alpha=\frac{\log(y_{\text{best}})-\mu}{\sigma} \] Through these formulas, the author detailedly derived the expression of log - transformed EI, filling in the intermediate steps not provided by Hutter et al. (2009). This not only helps to understand the effectiveness of this method but also provides theoretical support for practical applications.