Bias-Variance Trade-off in Physics-Informed Neural Networks with Randomized Smoothing for High-Dimensional PDEs

Zheyuan Hu, Zhouhao Yang, Yezhen Wang, George Em Karniadakis, Kenji Kawaguchi
2023-11-27
Abstract:While physics-informed neural networks (PINNs) have been proven effective for low-dimensional partial differential equations (PDEs), the computational cost remains a hurdle in high-dimensional scenarios. This is particularly pronounced when computing high-order and high-dimensional derivatives in the physics-informed loss. Randomized Smoothing PINN (RS-PINN) introduces Gaussian noise for stochastic smoothing of the original neural net model, enabling Monte Carlo methods for derivative approximation, eliminating the need for costly auto-differentiation. Despite its computational efficiency in high dimensions, RS-PINN introduces biases in both loss and gradients, negatively impacting convergence, especially when coupled with stochastic gradient descent (SGD). We present a comprehensive analysis of biases in RS-PINN, attributing them to the nonlinearity of the Mean Squared Error (MSE) loss and the PDE nonlinearity. We propose tailored bias correction techniques based on the order of PDE nonlinearity. The unbiased RS-PINN allows for a detailed examination of its pros and cons compared to the biased version. Specifically, the biased version has a lower variance and runs faster than the unbiased version, but it is less accurate due to the bias. To optimize the bias-variance trade-off, we combine the two approaches in a hybrid method that balances the rapid convergence of the biased version with the high accuracy of the unbiased version. In addition, we present an enhanced implementation of RS-PINN. Extensive experiments on diverse high-dimensional PDEs, including Fokker-Planck, HJB, viscous Burgers', Allen-Cahn, and Sine-Gordon equations, illustrate the bias-variance trade-off and highlight the effectiveness of the hybrid RS-PINN. Empirical guidelines are provided for selecting biased, unbiased, or hybrid versions, depending on the dimensionality and nonlinearity of the specific PDE problem.
Machine Learning,Artificial Intelligence,Dynamical Systems,Numerical Analysis
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper mainly explores how to optimize the physics - informed neural network based on stochastic smoothing (RS - PINN) in high - dimensional partial differential equations (PDEs). Specifically, the authors focus on: 1. **The problem of high computational cost**: When dealing with high - dimensional PDEs, traditional PINNs require the calculation of high - order and high - dimensional derivatives, resulting in very high computational costs. In particular, when calculating the physics - informed loss, the cost of automatic differentiation is very large. 2. **The problem of introducing bias**: RS - PINN performs stochastic smoothing by introducing Gaussian noise and uses the Monte Carlo method to approximate derivatives, thus avoiding expensive automatic differentiation. However, this method introduces biases in the loss and gradient, especially when combined with the stochastic gradient descent (SGD) algorithm, and these biases will seriously affect the convergence of the model. 3. **Bias - variance trade - off**: To balance bias and variance, the authors propose a bias - correction technique and analyze the advantages and disadvantages of the biased and unbiased versions respectively. The biased version runs faster but has lower accuracy; the unbiased version is more accurate but has larger variance and runs more slowly. ### Solutions To solve the above problems, the authors take the following measures: - **Analysis of bias sources**: Analyze in detail the sources of bias in RS - PINN, including the non - linearity of the mean - squared - error (MSE) loss function and the non - linearity of the PDE itself. - **Bias - correction technique**: Propose a bias - correction method for PDEs with different non - linearity orders. For example, for the second - order non - linear HJB equation, resample using four independent Gaussian samples to eliminate bias. - **Hybrid method**: Propose a hybrid method that combines the advantages of the biased and unbiased versions. In the initial stage, use the biased version to quickly converge to a better point, and then switch to the unbiased version for fine - tuning to obtain higher accuracy. ### Experimental verification Through extensive experiments, the authors verify the effectiveness of the hybrid method. The experiments cover a variety of high - dimensional PDEs, such as the Fokker - Planck equation, the Hamilton - Jacobi - Bellman equation, the viscous Burgers equation, the Allen - Cahn equation, and the Sine - Gordon equation. The experimental results show that the hybrid method can achieve the best performance in different dimensions and non - linear situations. ### Summary In general, this paper aims to solve the bias problem introduced by RS - PINN when dealing with high - dimensional PDEs, and proposes an effective hybrid method to optimize the bias - variance trade - off, thereby improving the convergence speed and accuracy of the model.