Abstract:This paper introduces PDEformer, a neural solver for partial differential equations (PDEs) capable of simultaneously addressing various types of PDEs. We propose to represent the PDE in the form of a computational graph, facilitating the seamless integration of both symbolic and numerical information inherent in a PDE. A graph Transformer and an implicit neural representation (INR) are employed to generate mesh-free predicted solutions. Following pretraining on data exhibiting a certain level of diversity, our model achieves zero-shot accuracies on benchmark datasets that is comparable to those of specifically trained expert models. Additionally, PDEformer demonstrates promising results in the inverse problem of PDE coefficient recovery.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to develop a fundamental model capable of efficiently handling various partial differential equations (PDEs). Specifically, the paper introduces a neural solver named PDEformer, aiming to solve multiple types of PDEs simultaneously. PDEformer seamlessly integrates the symbolic and numerical information in PDEs by representing PDEs in the form of a computational graph. It uses Graph Transformer and Implicit Neural Representation (INR) to generate mesh - free predicted solutions. After pre - training on data with a certain degree of diversity, the zero - sample accuracy of PDEformer on the benchmark dataset is comparable to that of expert models trained specifically. In addition, PDEformer also shows promising results in the inverse problem of PDE coefficient recovery. ### Main Contributions 1. **Generality**: PDEformer aims to construct a fundamental PDE model with the highest generality, which can ideally handle any PDE. For a new PDE, this model can be directly used for zero - sample inference, or fine - tuned with a small number of solution snapshots. 2. **Computational Graph Representation**: Represent the symbolic form of PDE as a computational graph, ensuring that the graph structure, its node types, and feature vectors encapsulate all the symbolic and numerical information required to solve the PDE. 3. **High Performance**: The pre - trained PDEformer shows higher zero - sample prediction accuracy on the benchmark dataset, and its performance can be further improved after fine - tuning. 4. **Inverse Problem Application**: PDEformer performs well in the inverse problem of PDE coefficient recovery, verifying its potential application value in downstream tasks. ### Method Overview The method of PDEformer mainly consists of the following steps: 1. **Computational Graph Construction**: Represent the symbolic information of PDE as a computational graph, where nodes can represent unknown field variables, scalar coefficients, initial conditions, differential operations, etc., and edges represent operands in operations. 2. **Graph Data Encoding**: Use Graph Transformer to integrate the symbolic and numerical information in the computational graph into a latent code. 3. **Decoding PDE Solutions**: Use INR to generate mesh - free predicted solutions according to coordinate inputs. ### Experimental Results 1. **Pre - training Stage**: Generate a dataset containing 500,000 samples, covering different types of PDEs, coefficients, and initial conditions. After pre - training, the relative L2 error of PDEformer on the training set is 0.0104, and on the test set is 0.0128. 2. **Forward Problem**: Evaluate the forward problem performance of PDEformer on the PDEBench dataset, including Burgers, Advection, and 1D Reaction - Diffusion PDEs. The results show that PDEformer performs excellently in zero - sample inference, even surpassing the baseline models trained specifically for these datasets. 3. **Inverse Problem**: Use PDEformer to conduct the inverse problem of PDE coefficient recovery. Even at a high noise level, it can effectively recover most PDE coefficients. ### Conclusion By representing PDEs as computational graphs and combining Graph Transformer and INR, PDEformer successfully constructs a PDE solver with strong generality and high performance. This model not only performs well in forward problems but also shows strong application potential in inverse problems. Although the current experiments are limited to one - dimensional PDEs, this achievement lays an important foundation for constructing more widely applicable fundamental PDE models.

PDEformer: Towards a Foundation Model for One-Dimensional Partial Differential Equations

PDEformer-1: A Foundation Model for One-Dimensional Partial Differential Equations

Latent Neural PDE Solver: a reduced-order modelling framework for partial differential equations

NeuralPDE: Modelling Dynamical Systems from Data

Convolution-Based Model-Solving Method for Three-Dimensional, Unsteady, Partial Differential Equations

An improved data-free surrogate model for solving partial differential equations using deep neural networks

Solving Partial Differential Equations Using Point-Based Neural Networks.

PDE-Net: Learning PDEs from Data

HAMLET: Graph Transformer Neural Operator for Partial Differential Equations

Numerical solution for high-dimensional partial differential equations based on deep learning with residual learning and data-driven learning

A Neural RDE-based model for solving path-dependent PDEs

PDE-Net 2.0: Learning PDEs from Data with A Numeric-Symbolic Hybrid Deep Network

Diffeomorphic Latent Neural Operators for Data-Efficient Learning of Solutions to Partial Differential Equations

Solving Partial Differential Equations Using Deep Learning and Physical Constraints

Discovering Physics-Informed Neural Networks Model for Solving Partial Differential Equations through Evolutionary Computation

Neural Operator: Graph Kernel Network for Partial Differential Equations

Towards a Foundation Model for Partial Differential Equations: Multi-Operator Learning and Extrapolation

Quantifying Training Difficulty and Accelerating Convergence in Neural Network-Based PDE Solvers

Graph Neural PDE Solvers with Conservation and Similarity-Equivariance

An Axiomatized PDE Model of Deep Neural Networks.

D3M: A deep domain decomposition method for partial differential equations