Approximation Theory, Computing, and Deep Learning on the Wasserstein Space

Massimo Fornasier,Pascal Heid,Giacomo Enrico Sodini
2024-10-10
Abstract:The challenge of approximating functions in infinite-dimensional spaces from finite samples is widely regarded as formidable. We delve into the challenging problem of the numerical approximation of Sobolev-smooth functions defined on probability spaces. Our particular focus centers on the Wasserstein distance function, which serves as a relevant example. In contrast to the existing body of literature focused on approximating efficiently pointwise evaluations, we chart a new course to define functional approximants by adopting three machine learning-based approaches: 1. Solving a finite number of optimal transport problems and computing the corresponding Wasserstein potentials. 2. Employing empirical risk minimization with Tikhonov regularization in Wasserstein Sobolev spaces. 3. Addressing the problem through the saddle point formulation that characterizes the weak form of the Tikhonov functional's Euler-Lagrange equation. We furnish explicit and quantitative bounds on generalization errors for each of these solutions. We leverage the theory of metric Sobolev spaces and we combine it with techniques of optimal transport, variational calculus, and large deviation bounds. In our numerical implementation, we harness appropriately designed neural networks to serve as basis functions. These networks undergo training using diverse methodologies. This approach allows us to obtain approximating functions that can be rapidly evaluated after training. Our constructive solutions significantly enhance at equal accuracy the evaluation speed, surpassing that of state-of-the-art methods by several orders of magnitude. This allows evaluations over large datasets several times faster, including training, than traditional optimal transport algorithms. Our analytically designed deep learning architecture slightly outperforms the test error of state-of-the-art CNN architectures on datasets of images.
Optimization and Control,Machine Learning,Functional Analysis
What problem does this paper attempt to address?
The main problem this paper attempts to address is how to efficiently approximate Sobolev smooth functions defined on the space of probability measures, particularly the Wasserstein distance function, given limited sample information. This problem is already very challenging in high-dimensional Euclidean spaces and becomes even more complex and difficult in infinite-dimensional metric spaces. Specifically, the authors focus on how to effectively approximate Sobolev smooth functions from a finite number of point evaluation information, with these functions being defined on the space of probability measures. They particularly focus on the Wasserstein distance function, which is an important example because the Wasserstein distance is the solution to the optimal transport problem. The paper proposes three machine learning-based methods to define function approximation: 1. **Solving a finite number of optimal transport problems and computing the corresponding Wasserstein potentials**: This method constructs the Wasserstein potential by solving a finite number of optimal transport problems, thereby approximating the Wasserstein distance. 2. **Using empirical risk minimization with Tikhonov regularization in the Wasserstein Sobolev space**: This method constructs function approximation by performing empirical risk minimization in the Wasserstein Sobolev space, combined with Tikhonov regularization techniques. 3. **Solving the Euler-Lagrange equation of the Tikhonov functional via a saddle point formulation**: This method constructs function approximation by formulating the Euler-Lagrange equation of the Tikhonov functional as a saddle point problem. Additionally, the authors provide explicit and quantitative generalization error bounds for each method and use well-designed neural networks as basis functions in the numerical implementation. These neural networks, once trained, can quickly evaluate functions, significantly improving evaluation speed while maintaining the same accuracy, outperforming existing methods by several orders of magnitude. This makes pairwise Wasserstein distance evaluation on large-scale datasets much faster than traditional optimal transport algorithms. In summary, this paper aims to address the problem of efficiently approximating Sobolev smooth functions on the space of probability measures by introducing new machine learning methods, particularly improving computational efficiency when dealing with large-scale data.