Statistical algorithms for low-frequency diffusion data: A PDE approach

Matteo Giordano,Sven Wang
2024-05-02
Abstract:We consider the problem of making nonparametric inference in multi-dimensional diffusion models from low-frequency data. Statistical analysis in this setting is notoriously challenging due to the intractability of the likelihood and its gradient, and computational methods have thus far largely resorted to expensive simulation-based techniques. In this article, we propose a new computational approach which is motivated by PDE theory and is built around the characterisation of the transition densities as solutions of the associated heat (Fokker-Planck) equation. Employing optimal regularity results from the theory of parabolic PDEs, we prove a novel characterisation for the gradient of the likelihood. Using these developments, for the nonlinear inverse problem of recovering the diffusivity (in divergence form models), we then show that the numerical evaluation of the likelihood and its gradient can be reduced to standard elliptic eigenvalue problems, solvable by powerful finite element methods. This enables the efficient implementation of a large class of statistical algorithms, including (i) preconditioned Crank-Nicolson and Langevin-type methods for posterior sampling, and (ii) gradient-based descent optimisation schemes to compute maximum likelihood and maximum-a-posteriori estimates. We showcase the effectiveness of these methods via extensive simulation studies in a nonparametric Bayesian model with Gaussian process priors. Interestingly, the optimisation schemes provided satisfactory numerical recovery while exhibiting rapid convergence towards stationary points despite the problem nonlinearity; thus our approach may lead to significant computational speed-ups. The reproducible code is available online at
Methodology,Numerical Analysis,Statistics Theory,Computation
What problem does this paper attempt to address?
This paper investigates the problem of nonparametric inference under low-frequency diffusion data, which is a challenge in statistical analysis due to the infeasibility of likelihood function and its gradients. The paper proposes a new computational method based on the theory of partial differential equations (PDEs) that relates the transition density of the diffusion process to the heat equation, and computes the likelihood function and its gradients using the elliptic PDE approach. This method can be used to implement various statistical algorithms, including Markov chain Monte Carlo and gradient descent optimization, to estimate the diffusion rate of the diffusion model. Through numerical experiments, the effectiveness of these methods in nonparametric Bayesian models is demonstrated, showing fast convergence to stable points even in the case of nonlinear problems, which could significantly improve computational speed.