Abstract:This paper presents a method to leverage arbitrary neural network architecture for control variates. Control variates are crucial in reducing the variance of Monte Carlo integration, but they hinge on finding a function that both correlates with the integrand and has a known analytical integral. Traditional approaches rely on heuristics to choose this function, which might not be expressive enough to correlate well with the integrand. Recent research alleviates this issue by modeling the integrands with a learnable parametric model, such as a neural network. However, the challenge remains in creating an expressive parametric model with a known analytical integral. This paper proposes a novel approach to construct learnable parametric control variates functions from arbitrary neural network architectures. Instead of using a network to approximate the integrand directly, we employ the network to approximate the anti-derivative of the integrand. This allows us to use automatic differentiation to create a function whose integration can be constructed by the antiderivative network. We apply our method to solve partial differential equations using the Walk-on-sphere algorithm. Our results indicate that this approach is unbiased and uses various network architectures to achieve lower variance than other control variate methods.

What problem does this paper attempt to address?

The core problem that this paper attempts to solve is: **How to use any neural network architecture to construct control variates (CV) to reduce the variance of Monte Carlo integration**. Specifically, the paper proposes a novel method to construct control variates with known analytical integrals by having the neural network approximate the antiderivative of the integrand instead of directly approximating the integrand itself. ### Background problem Monte Carlo integration is an important tool for estimating integral values through random sampling and is widely used in fields such as computer graphics and physical simulation. However, one of the main drawbacks of Monte Carlo integration is that its results usually have a high variance, requiring a large number of samples to obtain an accurate estimate. To reduce the variance, one of the commonly used techniques is the control variate method. ### Limitations of traditional methods Traditional control variate methods rely on finding a function related to the integrand and having a known analytical integral. These methods are usually based on heuristic selection and may not match complex integrands well, thus limiting their performance. Recent research has attempted to use learnable parametric models (such as neural networks) to approximate the integrand, but still faces the challenge of how to construct an expressive parametric model with a known analytical integral. ### The solution proposed in the paper This paper proposes a new method by having the neural network approximate the antiderivative of the integrand instead of directly approximating the integrand itself. The specific steps are as follows: 1. **Define the antiderivative network**: Suppose we have a neural network \( G_\theta: \mathbb{R} \to \mathbb{R} \) that approximates the antiderivative of the integrand \( f(x) \), that is: \[ \frac{\partial}{\partial x} G_\theta(x) = g(x) \] where \( g(x) \) is the control variate function. 2. **Apply automatic differentiation**: Calculate the derivative \( \frac{\partial}{\partial x} G_\theta(x) \) of \( G_\theta(x) \) through an automatic differentiation framework and use it as a control variate. 3. **Construct the integration formula**: According to the fundamental theorem of calculus, we can obtain: \[ \int_l^u f(x) \, dx = G_\theta(u) - G_\theta(l) + \int_l^u \left( f(x) - \frac{\partial}{\partial x} G_\theta(x) \right) \, dx \] In this way, we can optimize \( G_\theta \) to minimize the variance of \( f(x) - \frac{\partial}{\partial x} G_\theta(x) \), thereby reducing the variance of the overall integral. ### Application example The author applies this method to the Walk - on - Sphere algorithm for solving partial differential equations (PDEs) and shows that this method can provide unbiased estimates under different network architectures and has a lower variance compared to other control variate methods. ### Summary The main contributions of the paper include: - Proposing a new method of using any neural network architecture as a control variate. - Proposing a numerically stable way to construct control variate estimators for different integration domains. - Demonstrating the effectiveness of this method in the application of solving Laplace and Poisson equations, with performance superior to existing baseline methods. This method not only expands the range of control variate selection but also improves the efficiency and accuracy of Monte Carlo integration.

Neural Control Variates with Automatic Integration

Neural Control Variates for Variance Reduction

Neural Control Variates

The CV Makes the Difference – Control Variates for Neural Networks

Using Large Ensembles of Control Variates for Variational Inference

Theoretical guarantees for neural control variates in MCMC

Control Variate Approximation for DNN Accelerators

Neural Networks are Integrable

Pathwise Gradient Variance Reduction with Control Variates in Variational Inference

Neural Differentiable Integral Control Barrier Functions for Unknown Nonlinear Systems with Input Constraints

Iterative Learning Control Of Varying Trajectories For Robot Manipulators

Neural variance reduction for stochastic differential equations

Function Approximation with Randomly Initialized Neural Networks for Approximate Model Reference Adaptive Control

A Neural Network Approach for Stochastic Optimal Control

Joint control variate for faster black-box variational inference

Neural Control of Parametric Solutions for High-dimensional Evolution PDEs

Neural Control Systems

A Variational Inequality Model for Learning Neural Networks

Neural Network Approaches for Parameterized Optimal Control