Interacting Langevin Diffusions: Gradient Structure And Ensemble Kalman Sampler

Alfredo Garbuno-Inigo,Franca Hoffmann,Wuchen Li,Andrew M. Stuart
DOI: https://doi.org/10.48550/arXiv.1903.08866
2019-10-17
Abstract:Solving inverse problems without the use of derivatives or adjoints of the forward model is highly desirable in many applications arising in science and engineering. In this paper, we propose a new version of such a methodology, a framework for its analysis, and numerical evidence of the practicality of the method proposed. Our starting point is an ensemble of over-damped Langevin diffusions which interact through a single preconditioner computed as the empirical ensemble covariance. We demonstrate that the nonlinear Fokker-Planck equation arising from the mean-field limit of the associated stochastic differential equation (SDE) has a novel gradient flow structure, built on the Wasserstein metric and the covariance matrix of the noisy flow. Using this structure, we investigate large time properties of the Fokker-Planck equation, showing that its invariant measure coincides with that of a single Langevin diffusion, and demonstrating exponential convergence to the invariant measure in a number of settings. We introduce a new noisy variant on ensemble Kalman inversion (EKI) algorithms found from the original SDE by replacing exact gradients with ensemble differences; this defines the ensemble Kalman sampler (EKS). Numerical results are presented which demonstrate its efficacy as a derivative-free approximate sampler for the Bayesian posterior arising from inverse problems.
Dynamical Systems
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to solve inverse problems without using the derivatives of the forward model or the adjoint operator. In many applications in science and engineering, this need is very urgent. The paper proposes a new methodological framework - Ensemble Kalman Sampler (EKS) to analyze the effectiveness of this method and provides numerical evidence to prove the practical feasibility of this method. Specifically, the paper focuses on sampling from the Bayesian posterior distribution by approximating the Langevin - type stochastic dynamical system through a set of over - damped Langevin diffusion processes. Ensemble Kalman Inversion (EKI) and its variants play an important role in solving large - scale science and engineering problems because these methods avoid calculating the derivatives and adjoint operators defined by the forward mapping \(G\), which is a difficult point in many practical applications. The main contributions of the paper include: 1. **Introduction of new noise perturbations**: A new noise - perturbed version of the continuous - time Ensemble Kalman Inversion algorithm is proposed, forming an interacting particle system in the form of a stochastic differential equation (SDE), namely Ensemble Kalman Sampler (EKS). 2. **Research on related SDEs**: A related SDE is studied, in which the ensemble differences are approximated by gradients; for linear inverse problems, this approximation is exact. The mean - field limit of this related SDE is studied and a new Kalman - Wasserstein gradient - flow structure in the associated nonlinear Fokker - Planck equation is shown. 3. **Steady - state characteristics**: Using the Kalman - Wasserstein structure, the steady - state of the nonlinear Fokker - Planck equation is characterized and it is shown that one of the steady - states is the posterior density \(\pi(u)\). 4. **Global attractor**: By explicitly solving the nonlinear Fokker - Planck equation in the case of linear \(G\), it is proved that the posterior density is the global attractor for all densities with finite initial energy and not Dirac measures. 5. **Numerical experiments**: Numerical examples are provided to prove that the EKS algorithm can provide good approximate samples of the posterior distribution for simple low - dimensional test problems and PDE inverse problems in Darcy flow. Through these contributions, the paper provides a new and effective method for generating approximate samples from the Bayesian posterior distribution without using derivatives.