Abstract:The past decade has seen increasing interest in applying Deep Learning (DL) to Computational Science and Engineering (CSE). Driven by impressive results in applications such as computer vision, Uncertainty Quantification (UQ), genetics, simulations and image processing, DL is increasingly supplanting classical algorithms, and seems poised to revolutionize scientific computing. However, DL is not yet well-understood from the standpoint of numerical analysis. Little is known about the efficiency and reliability of DL from the perspectives of stability, robustness, accuracy, and sample complexity. In particular, approximating solutions to parametric PDEs is an objective of UQ for CSE. Training data for such problems is often scarce and corrupted by errors. Moreover, the target function is a possibly infinite-dimensional smooth function taking values in the PDE solution space, generally an infinite-dimensional Banach space. This paper provides arguments for Deep Neural Network (DNN) approximation of such functions, with both known and unknown parametric dependence, that overcome the curse of dimensionality. We establish practical existence theorems that describe classes of DNNs with dimension-independent architecture size and training procedures based on minimizing the (regularized) $\ell^2$-loss which achieve near-optimal algebraic rates of convergence. These results involve key extensions of compressed sensing for Banach-valued recovery and polynomial emulation with DNNs. When approximating solutions of parametric PDEs, our results account for all sources of error, i.e., sampling, optimization, approximation and physical discretization, and allow for training high-fidelity DNN approximations from coarse-grained sample data. Our theoretical results fall into the category of non-intrusive methods, providing a theoretical alternative to classical methods for high-dimensional approximation.

Low Dimensional Trajectory Hypothesis is True: DNNs Can Be Trained in Tiny Subspaces.

Train Deep Neural Networks in 40-D Subspaces

Deep Manifold Transformation for Dimension Reduction

DMT-EV: an Explainable Deep Network for Dimension Reduction.

UDRN: Unified Dimensional Reduction Neural Network for Feature Selection and Feature Projection

How many degrees of freedom do we need to train deep networks: a loss landscape perspective

The Training Process of Many Deep Networks Explores the Same Low-Dimensional Manifold

Low dimensional approximation and generalization of multivariate functions on smooth manifolds using deep ReLU neural networks

Does SGD really happen in tiny subspaces?

Dynamic Sparse Graph for Efficient Deep Learning.

Near-optimal learning of Banach-valued, high-dimensional functions via deep neural networks

Maestro: Uncovering Low-Rank Structures via Trainable Decomposition

slimTrain -- A Stochastic Approximation Method for Training Separable Deep Neural Networks

Efficient NTK using Dimensionality Reduction

Hypothesis Spaces for Deep Learning

Scaling Down Deep Learning with MNIST-1D

A dynamical systems based framework for dimension reduction

Incremental Learning in Diagonal Linear Networks

Training neural networks on high-dimensional data using random projection

Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation

Bottleneck Structure in Learned Features: Low-Dimension vs Regularity Tradeoff