Mesh-Informed Neural Networks for Operator Learning in Finite Element Spaces

Nicola Rares Franco,Andrea Manzoni,Paolo Zunino
DOI: https://doi.org/10.48550/arXiv.2203.11648
2023-05-05
Abstract:Thanks to their universal approximation properties and new efficient training strategies, Deep Neural Networks are becoming a valuable tool for the approximation of mathematical operators. In the present work, we introduce Mesh-Informed Neural Networks (MINNs), a class of architectures specifically tailored to handle mesh based functional data, and thus of particular interest for reduced order modeling of parametrized Partial Differential Equations (PDEs). The driving idea behind MINNs is to embed hidden layers into discrete functional spaces of increasing complexity, obtained through a sequence of meshes defined over the underlying spatial domain. The approach leads to a natural pruning strategy which enables the design of sparse architectures that are able to learn general nonlinear operators. We assess this strategy through an extensive set of numerical experiments, ranging from nonlocal operators to nonlinear diffusion PDEs, where MINNs are compared against more traditional architectures, such as classical fully connected Deep Neural Networks, but also more recent ones, such as DeepONets and Fourier Neural Operators. Our results show that MINNs can handle functional data defined on general domains of any shape, while ensuring reduced training times, lower computational costs, and better generalization capabilities, thus making MINNs very well-suited for demanding applications such as Reduced Order Modeling and Uncertainty Quantification for PDEs.
Numerical Analysis
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to develop a new neural network architecture - Mesh - Informed Neural Networks (MINNs) to efficiently process grid - based functional data, especially for reduced - order modeling of parameterized partial differential equations (PDEs). Specifically, the paper aims to solve the problem through the following points: 1. **Handling nonlinear operators in high - dimensional spaces**: Traditional fully - connected deep neural networks have problems such as long training time, high computational cost, and easy over - fitting when handling nonlinear operators in high - dimensional spaces. MINNs overcome these problems by introducing a sparse architecture, enabling them to learn these operators more effectively. 2. **Adapting to domains of arbitrary shapes**: MINNs can process functional data on domains of arbitrary shapes, not just hypercubes or graph structures. This makes MINNs more flexible and effective in dealing with problems of complex geometric shapes. 3. **Improving generalization ability**: By embedding hidden layers into discrete functional spaces, MINNs can learn general nonlinear operators and have better generalization ability. This is especially important in multi - query applications (such as optimal control and statistical inference), because these applications require frequent calls to expensive numerical solvers. 4. **Reducing training time and computational cost**: MINNs reduce the degrees of freedom of the model by using a prior pruning strategy, thereby reducing training time and computational cost. This strategy utilizes the geometric knowledge from the underlying space to make the model more efficient. 5. **Comparison with existing methods**: Through extensive numerical experiments, the paper compares MINNs with traditional fully - connected deep neural networks and some recent methods (such as DeepONets and Fourier Neural Operators), showing the advantages of MINNs in handling non - local operators and nonlinear diffusion PDEs, etc. In summary, the main objective of this paper is to solve the problems existing in traditional deep neural networks when handling nonlinear operators in high - dimensional spaces by proposing MINNs, and to provide a more efficient, more flexible, and more generalization - capable solution.