Densely Multiplied Physics Informed Neural Networks

Feilong Jiang,Xiaonan Hou,Min Xia
2024-10-04
Abstract:Although physics-informed neural networks (PINNs) have shown great potential in dealing with nonlinear partial differential equations (PDEs), it is common that PINNs will suffer from the problem of insufficient precision or obtaining incorrect outcomes. Unlike most of the existing solutions trying to enhance the ability of PINN by optimizing the training process, this paper improved the neural network architecture to improve the performance of PINN. We propose a densely multiply PINN (DM-PINN) architecture, which multiplies the output of a hidden layer with the outputs of all the behind hidden layers. Without introducing more trainable parameters, this effective mechanism can significantly improve the accuracy of PINNs. The proposed architecture is evaluated on four benchmark examples (Allan-Cahn equation, Helmholtz equation, Burgers equation and 1D convection equation). Comparisons between the proposed architecture and different PINN structures demonstrate the superior performance of the DM-PINN in both accuracy and efficiency.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that Physics - Informed Neural Networks (PINNs) often face the problems of insufficient accuracy or incorrect results when dealing with nonlinear partial differential equations (PDEs). Specifically, PINNs may encounter the following challenges when solving PDEs: 1. **Unbalanced Residual Distribution**: When calculating the loss function, some points with large residuals may account for only a small part of the entire training data set, resulting in the importance of these points being ignored and thus affecting the accuracy of the final solution. 2. **Space - Time Causality Problem**: Different from traditional numerical calculation methods, PINNs lack the guarantee of causality during the training process. That is, all time and space points are input into the neural network simultaneously, and it cannot be ensured that the problems of early time points are solved first and then the problems of later time points are solved. 3. **Simple Structure**: Traditional PINNs adopt simple fully - connected neural networks (FC - NN), and this structure may have limitations when expressing complex physical phenomena. To solve these problems, the author proposes a new neural network architecture - Densely Multiplied PINN (DM - PINN). By introducing element - level multiplication operations between hidden - layer outputs, DM - PINN can significantly improve the accuracy and performance of PINNs without increasing the number of trainable parameters. In addition, in order to deal with the vanishing gradient problem and further improve the model performance, the author also introduces Batch Normalization layers and ResNet structures (residual blocks), and proposes Skip Densely Multiplied PINN (SDM - PINN). The paper verifies the effectiveness of the proposed method through four benchmark test cases (Allan - Cahn equation, Helmholtz equation, Burgers equation, and 1D convection equation) and compares it with existing PINN structures, demonstrating the superiority of DM - PINN in terms of accuracy and efficiency.