Error Analysis of Three-Layer Neural Network Trained with PGD for Deep Ritz Method

Yuling Jiao,Yanming Lai,Yang Wang

2024-05-19

Abstract:Machine learning is a rapidly advancing field with diverse applications across various domains. One prominent area of research is the utilization of deep learning techniques for solving partial differential equations(PDEs). In this work, we specifically focus on employing a three-layer tanh neural network within the framework of the deep Ritz method(DRM) to solve second-order elliptic equations with three different types of boundary conditions. We perform projected gradient descent(PDG) to train the three-layer network and we establish its global convergence. To the best of our knowledge, we are the first to provide a comprehensive error analysis of using overparameterized networks to solve PDE problems, as our analysis simultaneously includes estimates for approximation error, generalization error, and optimization error. We present error bound in terms of the sample size $n$ and our work provides guidance on how to set the network depth, width, step size, and number of iterations for the projected gradient descent algorithm. Importantly, our assumptions in this work are classical and we do not require any additional assumptions on the solution of the equation. This ensures the broad applicability and generality of our results.

Numerical Analysis,Artificial Intelligence,Analysis of PDEs,Machine Learning

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper primarily explores the problem of solving partial differential equations (PDEs) using deep learning methods. Specifically, it addresses the following: 1. **Solving second-order elliptic equations using a three-layer neural network**: The paper focuses on solving second-order elliptic equations with three different boundary conditions using a three-layer tanh neural network within the framework of the Deep Ritz method. 2. **Error analysis**: It provides a comprehensive error analysis, including approximation error, generalization error, and optimization error. This is the first time that all three types of errors are considered simultaneously in solving PDE problems. 3. **Theoretical guidance**: It offers specific guidance on how to set the depth, width, step size, and number of iterations of the neural network to ensure the effectiveness of the projected gradient descent algorithm. 4. **Generality and applicability**: The assumptions made are classical and common, without requiring additional assumptions about the solution of the equation, thus ensuring the broad applicability and generality of the results. Through these studies, the paper aims to provide a solid theoretical foundation for solving PDE problems using machine learning methods.

Error Analysis of Three-Layer Neural Network Trained with PGD for Deep Ritz Method

Error Analysis of Deep Ritz Methods for Elliptic Equations.

Convergence Rate Analysis for Deep Ritz Method

Deep Ritz Methods for Laplace Equations with Dirichlet Boundary Condition

A Natural Primal-Dual Hybrid Gradient Method for Adversarial Neural Network Training on Solving Partial Differential Equations

Full error analysis for the training of deep neural networks

A Priori Error Estimate of Deep Mixed Residual Method for Elliptic PDEs

A Non-Gradient Method for Solving Elliptic Partial Differential Equations with Deep Neural Networks.

Machine Learning For Elliptic PDEs: Fast Rate Generalization Bound, Neural Scaling Law and Minimax Optimality

Convergence Analysis of a Quasi-Monte Carlo-based Deep Learning Algorithm for Solving Partial Differential Equations

Deep Petrov-Galerkin Method for Solving Partial Differential Equations

Global Convergence of Deep Galerkin and PINNs Methods for Solving Partial Differential Equations

A comparison study of deep Galerkin method and deep Ritz method for elliptic problems with different boundary conditions

Error Analysis of the Deep Mixed Residual Method for High-order Elliptic Equations

Error analysis for empirical risk minimization over clipped ReLU networks in solving linear Kolmogorov partial differential equations

Error Analysis of Physics-Informed Neural Networks for Approximating Dynamic PDEs of Second Order in Time

Adaptive multilayer neural network for solving elliptic partial differential equations with different boundary conditions

Implicit Bias in Understanding Deep Learning for Solving PDEs Beyond Ritz-Galerkin Method

Refined generalization analysis of the Deep Ritz Method and Physics-Informed Neural Networks

Statistical Numerical PDE : Fast Rate, Neural Scaling Law and When it’s Optimal