Rui Zhang,Qi Meng,Rongchan Zhu,Yue Wang,Wenlei Shi,Shihua Zhang,Zhi-Ming Ma,Tie-Yan Liu
Abstract:In scenarios with limited available data, training the function-to-function neural PDE solver in an unsupervised manner is essential. However, the efficiency and accuracy of existing methods are constrained by the properties of numerical algorithms, such as finite difference and pseudo-spectral methods, integrated during the training stage. These methods necessitate careful spatiotemporal discretization to achieve reasonable accuracy, leading to significant computational challenges and inaccurate simulations, particularly in cases with substantial spatiotemporal variations. To address these limitations, we propose the Monte Carlo Neural PDE Solver (MCNP Solver) for training unsupervised neural solvers via the PDEs' probabilistic representation, which regards macroscopic phenomena as ensembles of random particles. Compared to other unsupervised methods, MCNP Solver naturally inherits the advantages of the Monte Carlo method, which is robust against spatiotemporal variations and can tolerate coarse step size. In simulating the trajectories of particles, we employ Heun's method for the convection process and calculate the expectation via the probability density function of neighbouring grid points during the diffusion process. These techniques enhance accuracy and circumvent the computational issues associated with Monte Carlo sampling. Our numerical experiments on convection-diffusion, Allen-Cahn, and Navier-Stokes equations demonstrate significant improvements in accuracy and efficiency compared to other unsupervised baselines. The source code will be publicly available at:
What problem does this paper attempt to address?
### Problems the paper attempts to solve
This paper aims to solve the problem of training function - to - function neural partial differential equation (PDE) solvers in an unsupervised manner with limited data. Specifically, existing methods integrate numerical algorithms (such as the finite - difference method and the pseudo - spectral method) in the training stage. These methods require fine spatio - temporal discretization to achieve reasonable accuracy, resulting in significant computational challenges and inaccurate simulations, especially in cases with large spatio - temporal variations.
To overcome these limitations, the authors propose the Monte Carlo Neural PDE Solver (MCNP Solver). This method trains an unsupervised neural solver through the probabilistic representation of PDEs, regarding macroscopic phenomena as collections of random particles. Compared with other unsupervised methods, the MCNP Solver naturally inherits the advantages of the Monte Carlo method, that is, it is robust to spatio - temporal variations and can tolerate coarser step sizes. In addition, this method uses the Heun method to handle the convection process when simulating particle trajectories and calculates the expected value of the diffusion process through the probability density function (PDF) of neighboring grid points, thereby improving accuracy and avoiding the computational problems of Monte Carlo sampling.
### Main contributions
1. **Propose the MCNP Solver**: An innovative unsupervised method for training neural solvers applicable to PDE systems with probabilistic representations. At the same time, multiple strategies are designed to improve the performance and efficiency of convection and diffusion processes in SDE simulations.
2. **Experimental verification**: Through experiments on the convection - diffusion equation, the Allen - Cahn equation, and the Navier - Stokes equation, the significant improvement of the MCNP Solver in handling complex spatio - temporal variations and coarse - step - size tasks is demonstrated. In addition, experiments on the 2D fractional - order diffusion equation are also carried out, expanding the application of the MCNP Solver in mesh - free and fractional Laplacian scenarios.
3. **Comprehensive comparison**: Not only is a comparison made with unsupervised learning methods, but also a comprehensive comparison is made with widely used PDE solvers (such as the classical Euler method, the Monte Carlo method, and supervised training methods), and the advantages, disadvantages, and application scopes of each method are discussed in detail.
### Method overview
#### 3.1 The Monte Carlo method and its theoretical basis
The paper considers the general convection - diffusion equation:
\[ \frac{\partial u}{\partial t}=\beta[u](x, t)\cdot\nabla u+\kappa\Delta u + f(x, t), \]
\[ u(x, 0)=u_0(x), \]
where \( x\in\Omega\subset\mathbb{R}^d \) and \( t \) represent the \( d \)-dimensional spatial variable and the time variable respectively, \( \beta[u](x, t)\in\mathbb{R}^d \) is a vector - valued mapping from \( (u, x, t) \) to \( \mathbb{R}^d \), \( \kappa\in\mathbb{R}^+ \) is the diffusion parameter, and \( f(x, t)\in\mathbb{R} \) represents the force term.
The Feynman - Kac formula provides the relationship between PDEs and their probabilistic representations. Through time reversal, we can obtain:
\[ \frac{\partial\tilde{u}}{\partial t}=-\beta[\tilde{u}](x, t)\cdot\nabla\tilde{u}-\kappa\Delta\tilde{u}-\tilde{f}(x, t), \]
\[ \tilde{u}(x, T)=u_0(x). \]
Applying the Feynman - Kac formula, we can obtain:
\[ \tilde{u}_0(x)=\mathbb{E}_{\tilde{\xi}}\left[\tilde{u}_T(\tilde{\xi}_T)+\int_0^T\tilde{f}(\tilde{\xi}_s, s)\,ds\right]