Stein Variational Evolution Strategies

Cornelius V. Braun,Robert T. Lange,Marc Toussaint
2024-10-14
Abstract:Stein Variational Gradient Descent (SVGD) is a highly efficient method to sample from an unnormalized probability distribution. However, the SVGD update relies on gradients of the log-density, which may not always be available. Existing gradient-free versions of SVGD make use of simple Monte Carlo approximations or gradients from surrogate distributions, both with limitations. To improve gradient-free Stein variational inference, we combine SVGD steps with evolution strategy (ES) updates. Our results demonstrate that the resulting algorithm generates high-quality samples from unnormalized target densities without requiring gradient information. Compared to prior gradient-free SVGD methods, we find that the integration of the ES update in SVGD significantly improves the performance on multiple challenging benchmark problems.
Machine Learning,Artificial Intelligence,Neural and Evolutionary Computing
What problem does this paper attempt to address?
### The Problem the Paper Attempts to Solve The paper aims to address the problem of efficiently sampling from unnormalized probability distributions in the absence of gradient information. Specifically, Stein Variational Gradient Descent (SVGD) is an efficient sampling method, but its update steps rely on the gradient of the log-density, which may not be available in many practical problems. Existing gradient-free versions of SVGD either use simple Monte Carlo approximations or the gradient of an alternative distribution, both of which have limitations. To improve gradient-free Stein variational inference, the authors propose a new method that combines SVGD steps with Evolution Strategies (ES) updates, called Stein Variational Evolution Strategies (SV-ES). In particular, they introduce Stein Variational CMA-ES (SV-CMA-ES), which achieves gradient-free SVGD by replacing the mean update of the search distribution in CMA-ES with the score term in the SVGD particle update. Experimental results show that SV-CMA-ES can generate high-quality samples without relying on gradient information and significantly outperforms existing gradient-free SVGD methods on several challenging benchmark problems.