Partitioned Surrogates and Thompson Sampling for Multidisciplinary Bayesian Optimization

Susanna Baars,Jigar Parekh,Ihar Antonau,Philipp Bekemeyer,Ulrich Römer
2024-08-16
Abstract:The long runtime associated with simulating multidisciplinary systems challenges the use of Bayesian optimization for multidisciplinary design optimization (MDO). This is particularly the case if the coupled system is modeled in a partitioned manner and feedback loops, known as strong coupling, are present. This work introduces a method for Bayesian optimization in MDO called "Multidisciplinary Design Optimization using Thompson Sampling", abbreviated as MDO-TS. Instead of replacing the whole system with a surrogate, we substitute each discipline with such a Gaussian process. Since an entire multidisciplinary analysis is no longer required for enrichment, evaluations can potentially be saved. However, the objective and associated uncertainty are no longer analytically estimated. Since most adaptive sampling strategies assume the availability of these estimates, they cannot be applied without modification. Thompson sampling does not require this explicit availability. Instead, Thompson sampling balances exploration and exploitation by selecting actions based on optimizing random samples from the objective. We combine Thompson sampling with an approximate sampling strategy that uses random Fourier features. This approach produces continuous functions that can be evaluated iteratively. We study the application of this infill criterion to both an analytical problem and the shape optimization of a simple fluid-structure interaction example.
Computational Engineering, Finance, and Science
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve the computational efficiency and complexity problems in Multidisciplinary Design Optimization (MDO). Specifically, the paper addresses the following challenges: 1. **Long running time**: Simulating multidisciplinary systems requires long - time computations, especially when the system is modeled in a partitioned manner and there are strong couplings (i.e., feedback loops), which makes the application of Bayesian optimization in MDO difficult. 2. **Limitations of traditional methods**: Traditional MDO methods usually rely on exact gradients (for example, calculated by the adjoint method), but for complex coupled systems, obtaining these gradients is very challenging. Therefore, current efforts are focused on making gradient - free optimization methods (such as Bayesian optimization) more efficiently applied to MDO. 3. **Balance between exploration and exploitation**: In Bayesian optimization, how to effectively balance exploration and exploitation is a key issue. Traditional adaptive sampling strategies rely on explicit objective functions and their uncertainty estimates, which are difficult to obtain in MDO. To solve these problems, the paper introduces a new Bayesian optimization method, called "Multidisciplinary Design Optimization using Thompson Sampling" (abbreviated as MDO - TS). The main features of this method include: - **Partitioned surrogate models**: Each discipline is replaced by a Gaussian Process (GP) surrogate model instead of replacing the entire system. This can reduce the amount of computation required for each evaluation. - **Thompson Sampling**: Use Thompson Sampling (TS) to balance exploration and exploitation. TS makes optimization decisions by drawing random samples from the posterior distribution and does not require explicit uncertainty estimates. - **Approximate sampling scheme**: Combine the Random Fourier Features (RFF) approximate sampling scheme to generate continuous functions, so that they can be evaluated during the iteration process. Through these improvements, MDO - TS can significantly reduce the computational cost while maintaining high accuracy, and can better handle the uncertainty and complexity in multidisciplinary systems. ### Formula summary - **Gaussian process regression**: - Posterior mean: \[ m_n(x^*) = k(x^*, X) \left( K + \sigma^2_\epsilon I \right)^{-1} y \] - Posterior covariance: \[ k_n(x^*, x^*) = k(x^*, x^*) - k(x^*, X) \left( K + \sigma^2_\epsilon I \right)^{-1} k(X, x^*) \] - **Thompson Sampling**: - In each iteration, draw a realization from the conditional GP: \[ \hat{f}_n|y \sim \text{GP}(m_n, k_n) \] - Select new control points: \[ x_j = \arg \min_x (\hat{f}_n|y)(x) \] - **Random Fourier Features**: - Approximate path update formula: \[ (\hat{f}_n|y)(\cdot) \approx \sum_{i = 1}^l w_i \phi_i(\cdot) + \sum_{j = 1}^n v_j k(\cdot, x_j) \] - where: \[ \phi_i = \sigma \sqrt{\frac{2}{l}} \cos(\theta^T x + \tau_i), \quad \thet