Abstract:Gaussian process regression is widely applied in computational science and engineering for surrogate modeling owning to its kernel-based and probabilistic nature. In this work, we propose a Bayesian approach that integrates the variability of input data into the Gaussian process regression for function and partial differential equation approximation. Leveraging two types of observables -- noise-corrupted outputs with certain inputs and those with prior-distribution-defined uncertain inputs, a posterior distribution of uncertain inputs is estimated via Bayesian inference. Thereafter, such quantified uncertainties of inputs are incorporated into Gaussian process predictions by means of marginalization. The setting of two types of data aligned with common scenarios of constructing surrogate models for the solutions of partial differential equations, where the data of boundary conditions and initial conditions are typically known while the data of solution may involve uncertainties due to the measurement or stochasticity. The effectiveness of the proposed method is demonstrated through several numerical examples including multiple one-dimensional functions, the heat equation and Allen-Cahn equation. A consistently good performance of generalization is observed, and a substantial reduction in the predictive uncertainties is achieved by the Bayesian inference of uncertain inputs.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: when constructing surrogate models of partial differential equations (PDEs), how to deal with the uncertainty of input data positions. Specifically, the paper proposes a Bayesian method to integrate the uncertainty of input data positions into Gaussian process regression in order to improve prediction performance and reduce the uncertainty in prediction. ### Problem Background In many scientific and engineering applications, it is very common to use data - driven methods to approximate the solutions of ordinary differential equations (ODEs) or partial differential equations (PDEs). However, these methods usually assume that the positions of input data are determined, while in actual situations, the positions of input data may be affected by factors such as measurement errors and environmental disturbances, and there is uncertainty. This uncertainty will affect the accuracy and generalization ability of the model. ### Core Problems of the Paper 1. **Uncertainty of Input Data Positions**: The paper focuses on how to deal with the uncertainty of input data positions in Gaussian process regression. Traditional methods usually only consider the noise of output data and ignore the uncertainty of input data positions. 2. **Application of Bayesian Inference**: In order to deal with the uncertainty of input data positions, the paper proposes a Bayesian method. Through Bayesian inference, the posterior distribution of uncertain input positions is estimated and incorporated into Gaussian process regression. 3. **Construction of PDE Surrogate Models**: The paper pays special attention to how to apply this method to the construction of PDE surrogate models. The boundary conditions and initial conditions of PDEs are usually known, but the solution data may contain uncertainty, so an effective method is needed to deal with these uncertainties. ### Solutions The solutions proposed in the paper include the following aspects: - **Bayesian Inference**: Estimate the posterior distribution of uncertain input positions through Bayesian inference. Specifically, the paper utilizes two types of data: noise - contaminated output data with determined input positions and data with uncertain input positions defined by prior distributions. - **Marginalization**: Integrate the estimated posterior distribution of uncertain input positions into Gaussian process regression through marginalization, thereby reducing the uncertainty in prediction. - **Numerical Experiment Verification**: Through numerical experiments such as multiple one - dimensional functions, the heat conduction equation, and the Allen - Cahn equation, the effectiveness of the proposed method is verified. The results show that this method can significantly reduce the uncertainty in prediction and improve the generalization ability of the model. ### Formula Representation The key formulas involved in the paper include: - **Joint Distribution of Gaussian Process Regression**: \[ \begin{bmatrix} y \\ f(x^*) \end{bmatrix} \mid X, \theta, \sigma_n^2 \sim \mathcal{N} \left(0, \begin{bmatrix} k(X,X;\theta) + \sigma_n^2 I & k(X,x^*;\theta) \\ k(x^*,X;\theta) & k(x^*,x^*;\theta) \end{bmatrix} \right) \] - **Posterior Distribution of Bayesian Inference**: \[ p(X_u \mid X_c, y_u, y_c, \theta, \sigma_n^2) \propto p(X_u) p(y_u, y_c \mid X_u, X_c, \theta, \sigma_n^2) \] - **Marginal Prediction Distribution**: \[ p(f(x^*) \mid X_c, y_c, y_u, \tilde{\theta}, \tilde{\sigma}_n^2) = \int p(f(x^*) \mid X_c, X_u, y_c, y_u, \tilde{\theta}, \tilde{\sigma}_n^2) p(X_u \mid X_c, y_u, y_c, \tilde{\theta}, \tilde{\sigma}_n^2) dX_u \] Through these methods, the paper is successful.

PDE-constrained Gaussian process surrogate modeling with uncertain data locations

A Surrogate Modelling Approach Based on Nonlinear Dimension Reduction for Uncertainty Quantification in Groundwater Flow Models

An Adaptive Gaussian Process‐based Method for Efficient Bayesian Experimental Design in Groundwater Contaminant Source Identification Problems

Gaussian process regression + deep neural network autoencoder for probabilistic surrogate modeling in nonlinear mechanics of solids

Bayesian learning with Gaussian processes for low-dimensional representations of time-dependent nonlinear systems

Conformal Approach To Gaussian Process Surrogate Evaluation With Coverage Guarantees

Surrogate-Based Inverse Modeling of the Hydrological System: An Adaptive Approach Considering Surrogate Structural Error.

Physics-Informed Gaussian Process Regression Generalizes Linear PDE Solvers

Uncertainty Quantification of Surrogate Models using Conformal Prediction

Bayesian Surrogate Analysis and Uncertainty Propagation

Variational Bayesian surrogate modelling with application to robust design optimisation

A two-step surrogate method for sequential uncertainty quantification in high-dimensional inverse problems

Evaluating Two Sparse Grid Surrogates and Two Adaptation Criteria for Groundwater Bayesian Uncertainty Quantification

A hybrid data driven-physics constrained Gaussian process regression framework with deep kernel for uncertainty quantification

Introduction To Gaussian Process Regression In Bayesian Inverse Problems, With New ResultsOn Experimental Design For Weighted Error Measures

Fully Bayesian Differential Gaussian Processes through Stochastic Differential Equations

Total Uncertainty Quantification in Inverse PDE Solutions Obtained with Reduced-Order Deep Learning Surrogate Models

An adaptive Gaussian process method for multi-modal Bayesian inverse problems

On the Inference of Applying Gaussian Process Modeling to a Deterministic Function

Bayesian-entropy Gaussian Process for Constrained Metamodeling.

Data-driven Modeling and Inference for Bayesian Gaussian Process ODEs via Double Normalizing Flows