Abstract:SIAM Journal on Mathematical Analysis, Volume 56, Issue 1, Page 1114-1148, February 2024. In the first part of this paper we develop a theory for image restoration with a learned regularizer that is analogous to that of Meyer's geometric characterization of solutions of the classical variational method of Rudin–Osher–Fatemi (ROF). The learned regularizer we use is a Kantorovich potential for an optimal transport problem of mapping a distribution of noisy images onto clean ones, as first proposed by Lunz, Öktem, and Schönlieb. We show that the effect of their restoration method on the distribution of the images is an explicit Euler discretization of a gradient flow on probability space, while our variational problem, dubbed Wasserstein ROF (WROF), is the corresponding implicit Euler discretization. We obtain our geometric characterization of the solution in this learned regularizer setting by first proving a much more general convex analysis theorem for variational problems having solutions characterized by projections. We then use optimal transport arguments to obtain the corresponding theorem for WROF from this general result, as well as a natural decomposition of a transport map into large scale "features" and small scale "details," where scale refers to the magnitude of the transport distance. In the second part of the paper we leverage our theory for restoration with learned regularizers to analyze two algorithms which iterate WROF. We refer to these as iterative regularization and multiscale transport. For the former we obtain a proof of convergence to the clean data. For the latter we produce successive approximations to the target distribution that match it up to finer and finer scales. These two algorithms are in complete analogy to well-known effective methods based on ROF for iterative denoising, respectively hierarchical image decomposition. We also obtain an analogue of the Tadmor–Nezzar–Vese energy identity, which decomposes the Wasserstein 2 distance between two measures into a sum of nonnegative terms that correspond to transport costs at different scales.

A New Perspective On Denoising Based On Optimal Transport

Optimal Transport for Unsupervised Denoising Learning

Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity

Posterior Sampling with Denoising Oracles via Tilted Transport

An optimal transport analogue of the Rudin Osher Fatemi model and its corresponding multiscale theory

On the Posterior Distribution in Denoising: Application to Uncertainty Quantification

An Optimal Transport Analogue of the Rudin–Osher–Fatemi Model and Its Corresponding Multiscale Theory

Neural Estimation Of Entropic Optimal Transport

Unifying Distributionally Robust Optimization via Optimal Transport Theory

Monge, Bregman and Occam: Interpretable Optimal Transport in High-Dimensions with Feature-Sparse Maps

Partial Relaxed Optimal Transport for Denoised Recommendation

Interpreting and Improving Diffusion Models from an Optimization Perspective

Denoising of structured random processes

Semi-Discrete Optimal Transport: Nearly Minimax Estimation With Stochastic Gradient Descent and Adaptive Entropic Regularization

From Denoising Diffusions to Denoising Markov Models

Sparsity-Aware Optimal Transport for Unsupervised Restoration Learning

Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance

Low-rank Optimal Transport: Approximation, Statistics and Debiasing

Non-asymptotic bounds for forward processes in denoising diffusions: Ornstein-Uhlenbeck is hard to beat

Observation-Guided Diffusion Probabilistic Models

A Variational Perspective on Solving Inverse Problems with Diffusion Models