Subspace method based on neural networks for eigenvalue problems

Xiaoying Dai,Yunying Fan,Zhiqiang Sheng
2024-10-17
Abstract:With the rapid development of machine learning, numerical discretization methods based on deep neural networks have been widely used in many fields, especially in solving high-dimensional problems where traditional methods face bottlenecks. However, for low-dimensional problems, existing machine learning methods are not as accurate and efficient as traditional methods. In this paper, we propose a subspace method based on neural networks for eigenvalue problems with high accuracy and low cost. Our basic idea is to use neural network based basis functions to span a subspace, then calculate the parameters of the neural network based basis functions through appropriate training, and finally calculate the Galerkin projection of the eigenvalue problem onto the subspace and obtain an approximate solution. In addition, we reduce the dimension of the subspace by applying some dimensionality reduction technique, which can improve the accuracy of the approximate solution in further. Numerical experiments show that we can obtain approximate eigenvalues with accuracy of $10^{-11}$ but with less then 400 epochs, which is significantly superior to other existing neural network based methods.
Numerical Analysis
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: when solving eigenvalue problems, the existing deep - neural - network - based methods are less accurate and efficient than traditional methods in dealing with low - dimensional problems. Therefore, the author proposes a subspace method based on neural networks (Subspace Method Based on Neural Networks, SNN) to improve the accuracy and efficiency of solving eigenvalue problems. ### Problem Background Eigenvalue problems are of great significance in many fields such as mathematics, physics, and chemistry, for example, in mechanics of materials, fluid mechanics, structural mechanics, and quantum mechanics. With the development of machine learning, numerical discretization methods based on deep neural networks are widely used in solving high - dimensional problems, but in low - dimensional problems, these methods are not as accurate and efficient as traditional methods. ### Paper Objectives To solve this problem, the author proposes a subspace method based on neural networks (SNN), aiming to improve the accuracy and efficiency of solving eigenvalue problems through the following steps: 1. **Construct basis functions using neural networks**: Use the basis functions generated by neural networks to span a subspace. 2. **Train neural network parameters**: Through appropriate training, optimize the parameters of neural network basis functions. 3. **Dimensionality reduction techniques**: Apply dimensionality reduction techniques to reduce the dimension of the subspace and further improve the accuracy of the approximate solution. 4. **Galerkin projection**: Project the original eigenvalue problem onto the subspace and solve for the approximate solution. ### Method Advantages Through this method, the author can obtain approximate eigenvalues with an accuracy of \(10^{-11}\) within a relatively small number of training epochs (less than 400 epochs), which is significantly better than other existing neural - network - based methods. ### Mathematical Representation To understand this method more clearly, the following are the Markdown representations of several key formulas: - **Weak form of eigenvalue problem**: \[ a(u, v)=\lambda b(u, v), \quad \forall v \in H^1_0(\Omega) \] where, \[ a(u, v)=(\alpha \nabla u, \nabla v)+(\beta u, v), \quad b(u, v)=(u, v) \] - **Loss function**: \[ L(\theta, w)=\text{Trace}(B^{-1}(v_1(x; \theta, w_1), \dots, v_k(x; \theta, w_k)) A(v_1(x; \theta, w_1), \dots, v_k(x; \theta, w_k))) \] where, \[ v_i(x; \theta, w_i)=\sum_{j = 1}^{M} w_i^{(j)} \phi_j(x, \theta) \] - **Termination criterion**: \[ \left|\frac{\bar{l}_s}{l_s}\right|\leq\epsilon \quad \text{or} \quad N_{\text{epoch}}\geq N_{\text{max}} \] where, \[ l_s = L_s(\theta, w), \quad \bar{l}_s=\frac{1}{10}\sum_{i = 1}^{10}|l_{s - i}-l_{s + 1 - i}| \] Through these formulas and steps, the author shows how to effectively solve eigenvalue problems using neural networks and subspace methods, especially in the low - dimensional case, achieving a significant improvement in accuracy.