Abstract:Gradient-based dimension reduction decreases the cost of Bayesian inference and probabilistic modeling by identifying maximally informative (and informed) low-dimensional projections of the data and parameters, allowing high-dimensional problems to be reformulated as cheaper low-dimensional problems. A broad family of such techniques identify these projections and provide error bounds on the resulting posterior approximations, via eigendecompositions of certain diagnostic matrices. Yet these matrices require gradients or even Hessians of the log-likelihood, excluding the purely data-driven setting and many problems of simulation-based inference. We propose a framework, derived from score-matching, to extend gradient-based dimension reduction to problems where gradients are unavailable. Specifically, we formulate an objective function to directly learn the score ratio function needed to compute the diagnostic matrices, propose a tailored parameterization for the score ratio network, and introduce regularization methods that capitalize on the hypothesized low-dimensional structure. We also introduce a novel algorithm to iteratively identify the low-dimensional reduced basis vectors more accurately with limited data based on eigenvalue deflation methods. We show that our approach outperforms standard score-matching for problems with low-dimensional structure, and demonstrate its effectiveness for PDE-constrained Bayesian inverse problems and conditional generative modeling.

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is how to achieve efficient dimensionality reduction when gradients are unavailable in Bayesian inference and probability modeling. Specifically, the paper proposes a method based on score ratio matching to extend the existing gradient - based dimensionality reduction techniques so that they can be applied to problems without gradient information. This enables more effective identification and utilization of low - dimensional structures in high - dimensional problems, thereby reducing the computational cost of Bayesian inference and probability modeling. ### Background and Problem Description of the Paper In Bayesian computation, a central goal is to develop effective algorithms to characterize conditional distributions, such as the posterior distribution $\pi_{X|Y = y^*}$ of parameter $X$ given observation $y^*$. As the parameter dimension $n$ and the observation dimension $m$ increase, the cost of such computations can become very high, making inference in high - dimensional problems difficult. Therefore, dimensionality reduction has become an important method to alleviate this computational burden. ### Limitations of Existing Methods Existing gradient - based dimensionality reduction methods identify and utilize low - dimensional structures by constructing specific diagnostic matrices. These diagnostic matrices contain gradient or Hessian information of the posterior or joint log - density. However, these methods require gradient or Hessian information, which excludes purely data - driven settings and many simulation - based inference problems. ### Contributions of the Paper 1. **Proposing a Framework**: The paper proposes a framework based on score ratio matching to extend the gradient - based dimensionality reduction methods to problems where gradients are unavailable. Specifically, a learning problem is introduced to approximate the gradient of the log - ratio of two densities, that is, the score ratio function. 2. **New Training Objectives and Network Parameterization**: New training objectives, network parameterization, and regularization methods are proposed, which are specifically optimized for the dimensionality reduction objective. 3. **Iterative Algorithm**: A new algorithm is developed. Through the eigenvalue elimination method, it more accurately iteratively identifies the required low - dimensional subspace basis vectors, especially in the case of limited data. 4. **Experimental Verification**: Through numerical experiments, it is shown that the proposed method outperforms the standard score - matching method on problems with low - dimensional structures and performs well in PDE - constrained Bayesian inverse problems and conditional generative modeling. ### Specific Problem Solving - **Score Ratio Matching**: By learning the score ratio function, diagnostic matrices can be constructed, thereby identifying the low - dimensional projections of parameters and observations. - **Utilization of Low - Rank Structures**: Assuming that the target distribution has a low - dimensional structure, these low - dimensional structures can be more effectively captured through specific network parameterization and regularization methods. - **Iterative Improvement**: Through the eigenvalue elimination method, the estimation of low - dimensional subspace basis vectors is gradually improved, increasing the accuracy of dimensionality reduction. ### Conclusion The method proposed in this paper not only extends the application range of the existing gradient - based dimensionality reduction techniques but also improves the efficiency and accuracy of dimensionality reduction when gradients are unavailable, providing new tools for high - dimensional Bayesian inference and probability modeling.

Dimension reduction via score ratio matching

High-dimensional Bayesian optimization using low-dimensional feature spaces

Conditional score-based diffusion models for Bayesian inference in infinite dimensions

Heavy-tailed denoising score matching

Exploring Dimension Learning Via a Penalized Probabilistic Principal Component Analysis

Optimal score estimation via empirical Bayes smoothing

Sparse sufficient dimension reduction using optimal scoring.

An Adaptive Dimension Reduction Estimation Method for High-dimensional Bayesian Optimization

On efficient dimension reduction with respect to a statistical functional of interest

Reducing the cost of posterior sampling in linear inverse problems via task-dependent score learning

Batch, match, and patch: low-rank approximations for score-based variational inference

Proximal methods for sparse optimal scoring and discriminant analysis

Estimation of High-Dimensional Graphical Models Using Regularized Score Matching

Nonparametric Score Estimators

Neural Score Matching for High-Dimensional Causal Inference

Provable benefits of score matching

Coupled Input-Output Dimension Reduction: Application to Goal-oriented Bayesian Experimental Design and Global Sensitivity Analysis

Local logistic regression for dimension reduction in classification

SCORE: A 1D Reparameterization Technique to Break Bayesian Optimization's Curse of Dimensionality

Efficient Score Matching with Deep Equilibrium Layers

Sufficient Dimension Reduction Using Hilbert-Schmidt Independence Criterion