Deep Neural Network Modeling of Unknown Partial Differential Equations in Nodal Space

Zhen Chen,Victor Churchill,Kailiang Wu,Dongbin Xiu
DOI: https://doi.org/10.1016/j.jcp.2021.110782
2021-06-07
Abstract:We present a numerical framework for deep neural network (DNN) modeling of unknown time-dependent partial differential equations (PDE) using their trajectory data. Unlike the recent work of [Wu and Xiu, J. Comput. Phys. 2020], where the learning takes place in modal/Fourier space, the current method conducts the learning and modeling in physical space and uses measurement data as nodal values. We present a DNN structure that has a direct correspondence to the evolution operator of the underlying PDE, thus establishing the existence of the DNN model. The DNN model also does not require any geometric information of the data nodes. Consequently, a trained DNN defines a predictive model for the underlying unknown PDE over structureless grids. A set of examples, including linear and nonlinear scalar PDE, system of PDEs, in both one dimension and two dimensions, over structured and unstructured grids, are presented to demonstrate the effectiveness of the proposed DNN modeling. Extension to other equations such as differential-integral equations is also discussed.
Machine Learning,Numerical Analysis
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to use deep neural networks (DNN) to model unknown time - dependent partial differential equations (PDEs). Specifically, the author proposes a numerical framework that uses trajectory data to learn and model in physical space, rather than in modal (Fourier) space as in most previous studies. The key to this method lies in constructing a DNN structure that directly corresponds to the evolution operator of the unknown PDE, so that it can perform predictive modeling of the unknown PDE on structured or unstructured grids. ### Main contributions 1. **General DNN framework**: The paper proposes a general DNN framework for modeling various types of unknown PDEs, rather than just identifying specific terms in the PDE. 2. **Learning directly in physical space**: Different from the previous methods of learning in modal space, the method in this paper directly uses measurement data as node values to learn in physical space without any geometric information. 3. **New DNN structure**: A new DNN structure is proposed, including disassembly blocks and assembly layers, which can directly correspond to the general time - step evolution of the unknown PDE. 4. **Mesh - free prediction model**: This method can work on arbitrarily distributed grids, forming a mesh - free prediction model, which is suitable for the situation where the positions of data points are more random in practical problems. ### Method overview 1. **Problem setting**: Consider an autonomous time - dependent PDE: \[ \begin{cases} u_t = L(u), & (x, t) \in \Omega\times\mathbb{R}^+ \\ B(u) = 0, & (x, t) \in \partial\Omega\times\mathbb{R}^+ \\ u(x, 0) = u_0(x), & x\in\bar{\Omega} \end{cases} \] where \(\Omega\subset\mathbb{R}^d\), \(L\) and \(B\) represent the PDE operator and the boundary condition operator respectively, and it is assumed that the PDE is unknown. 2. **Data set**: Assume that there is data of the state variable \(u\) on a set of nodes \(X_N=\{x_1,\ldots,x_N\}\subset\Omega\), and these data are available at some discrete time points, forming so - called snapshots: \[ u(t_j^{(k)}),\quad j = 1,\ldots,\ell^{(k)},\quad k = 1,\ldots,N_{\text{traj}} \] These snapshots are paired into data pairs of consecutive time points: \[ (u(t_j^{(k)}), u(t_{j + 1}^{(k)})),\quad j = 1,\ldots,\ell^{(k)}- 1,\quad k = 1,\ldots,N_{\text{traj}} \] 3. **DNN structure**: The proposed DNN structure includes: - **Input layer**: The number of neurons is \(N\), that is, the dimension of \(u\). - **Disassembly block**: It contains \(J\geq1\) fully - connected feed - forward neural networks (FNNs), each FNN has a width of \(n_w\) and a depth of \(n_d\), and receives input from the input layer. - **Assembly layer**: A standard fully - connected FNN with a width of \(J\) and a depth of \(n_a\), operating along the thickness direction of the disassembly block. - **Output layer**: The number of neurons is \(N\), that is, the dimension of \(u\), and the input layer is re - introduced in the way of a residual network (ResNet). 4. **Model training**: Use the mean - square loss function to minimize and train the DNN model: \[ L(\Theta; \Delta t)=