Efficient Approximation of Solutions of Parametric Linear Transport Equations by ReLU DNNs

Fabian Laakmann,Philipp Petersen
DOI: https://doi.org/10.48550/arXiv.2001.11441
2020-01-31
Abstract:We demonstrate that deep neural networks with the ReLU activation function can efficiently approximate the solutions of various types of parametric linear transport equations. For non-smooth initial conditions, the solutions of these PDEs are high-dimensional and non-smooth. Therefore, approximation of these functions suffers from a curse of dimension. We demonstrate that through their inherent compositionality deep neural networks can resolve the characteristic flow underlying the transport equations and thereby allow approximation rates independent of the parameter dimension.
Numerical Analysis,Analysis of PDEs,Functional Analysis
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to efficiently approximate the solutions of parametric linear transport equations using deep neural networks (DNNs). Specifically, the article explores the use of deep neural networks with ReLU activation functions to approximate the solutions of these equations and shows that this method can overcome the curse of dimensionality in high - dimensional problems. The following is a detailed interpretation of the paper's problem: ### Research Background and Problem Description 1. **Importance of Linear Parametric Transport Equations** - Linear parametric transport equations play an important role in engineering, modeling, and mathematical physics, and are used to describe physical phenomena such as heat conduction and mass transfer. - A typical example is the transport of pollutants in air or water, which depends on parameters such as the direction and intensity of fluid flow. 2. **Challenges of the Problem** - For non - smooth initial conditions, the solutions of these partial differential equations (PDEs) are high - dimensional and non - smooth. - Direct approximation of these functions is affected by the curse of dimensionality, that is, as the dimension of the parameter space increases, the approximation error deteriorates exponentially. ### Core Contributions of the Paper - **Application of the Method of Characteristics** - Through the method of characteristics, even if the solution is not smooth, its singularity also propagates along smooth curves called characteristic curves. - In this framework, the function \( u \) can be represented as a combination of two functions: one is high - dimensional and smooth, and the other is low - dimensional and (possibly) rough. - **Advantages of Deep Neural Networks** - Due to their inherent combinatorial structure, deep neural networks can analyze the feature flow behind the transport equations, allowing the approximation rate to be independent of the parameter dimension. - In particular, in the given setting, the author observes that the approximation rate is independent of the dimension \( D \) of the parameter space. ### Mathematical Formulas The main formulas involved in the article include: - **Transport Equation** \[ \begin{cases} \partial_t u(t, x, \eta)+V(t, x, \eta)\cdot\nabla_x u(t, x, \eta)=f(t, x, \eta), \\ u(0, x, \eta)=u_0(x). \end{cases} \] - **Characteristic Curves** \[ \begin{cases} \dot{\gamma}(s)=V(s, \gamma(s), \eta), \\ \gamma(t)=x. \end{cases} \] - **Expression of the Solution** - For the standard linear transport equation: \[ u(t, x, \eta)=u_0(X(0, t, x, \eta)). \] - For the inhomogeneous linear transport equation with a source term: \[ u(t, x, \eta)=u_0(X(0, t, x, \eta))+\int_0^t f(s, X(s, t, x, \eta), \eta)\,ds. \] ### Applications and Significance - **Approximation Theory** - The solutions of high - dimensional parametric linear transport equations form a related but non - standard class of functions. Studying the approximation ability of deep neural networks in this structure is of great significance. - **Estimation Problems** - In machine learning, especially deep learning, deep neural networks are trained to minimize the empirical energy based on random samples. For transport problems with unknown parameters, this technique can successfully solve the problem. - **Numerical Analysis** - As a trial space for PDEs, deep neural networks can adaptively capture the real structure of the solution, especially suitable for dealing with singular points distributed along low - dimensional manifolds. ### Summary By introducing deep neural networks, the paper provides a new method for efficiently approximating the solutions of parametric linear transport equations, overcoming the limitations of traditional methods in high - dimensional problems. This method not only improves computational efficiency but also provides a solution for complex physical problems.