Structured prior distributions for the covariance matrix in latent factor models

Sarah Elizabeth Heaps,Ian Hyla Jermyn

DOI: https://doi.org/10.1007/s11222-024-10454-0

IF: 2.3241

2024-06-27

Statistics and Computing

Abstract:Factor models are widely used for dimension reduction in the analysis of multivariate data. This is achieved through decomposition of a covariance matrix into the sum of two components. Through a latent factor representation, they can be interpreted as a diagonal matrix of idiosyncratic variances and a shared variation matrix, that is, the product of a factor loadings matrix and its transpose. If , this defines a parsimonious factorisation of the covariance matrix. Historically, little attention has been paid to incorporating prior information in Bayesian analyses using factor models where, at best, the prior for the factor loadings is order invariant. In this work, a class of structured priors is developed that can encode ideas of dependence structure about the shared variation matrix. The construction allows data-informed shrinkage towards sensible parametric structures while also facilitating inference over the number of factors. Using an unconstrained reparameterisation of stationary vector autoregressions, the methodology is extended to stationary dynamic factor models. For computational inference, parameter-expanded Markov chain Monte Carlo samplers are proposed, including an efficient adaptive Gibbs sampler. Two substantive applications showcase the scope of the methodology and its inferential benefits.

statistics & probability,computer science, theory & methods

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is how to effectively combine prior information in the Bayesian factor model, especially the prior distribution encoding the dependence structure in the shared variation matrix. Specifically, the authors propose a class of structured prior distributions. These prior distributions can reflect prior beliefs about the dependence relationships among variables, while allowing data - driven shrinkage towards reasonable parameter structures and supporting the inference of the number of factors. In addition, by introducing an unconstrained re - parameterization method, this study also extends this method to the stationary dynamic factor model. This is the first general dynamic factor model prior and its associated inference scheme that restricts the inference within the stationary region without imposing additional restrictions. The paper also proposes efficient parameter - expanded Markov chain Monte Carlo (MCMC) samplers, including an efficient adaptive Gibbs sampler for computational inference. The key contribution of the paper lies in providing a flexible framework that can incorporate initial beliefs about the shared variation matrix into the prior of the factor loading matrix. By exploiting the algebraic relationship between the two, this method is more flexible than other methods described in the literature. This not only improves the interpretability of the model but also enhances the effectiveness of dimensionality reduction analysis for high - dimensional data.

Structured prior distributions for the covariance matrix in latent factor models

Blessing of dimension in Bayesian inference on covariance matrices

Sparse Bayesian factor analysis when the number of factors is unknown

Covariance Structure Estimation with Laplace Approximation

Covariance Function Estimation for High-Dimensional Functional Time Series with Dual Factor Structures

Incorporating graph information in Bayesian factor analysis with robust and adaptive shrinkage priors

Modeling the Cholesky factors of covariance matrices of multivariate longitudinal data

Bayesian Nonparametric Covariance Regression

Inferring Covariance Structure from Multiple Data Sources via Subspace Factor Analysis

Exponential Family Factors for Bayesian Factor Analysis

Fitting Multilevel Factor Models

Constrained Factor Models for High-Dimensional Matrix-Variate Time Series

Latent Factor Models for Density Estimation

Bayesian Multilevel Structural Equation Modeling: An Investigation into Robust Prior Distributions for the Doubly Latent Categorical Model

Factor-guided estimation of large covariance matrix function with conditional functional sparsity

Bayesian inference for a covariance matrix

Sparse factor models of high dimension

Block-diagonal idiosyncratic covariance estimation in high-dimensional factor models for financial time series

High-dimensional Factor Model and Its Applications to Statistical Machine Learning

High-Dimensional Conditional Covariance Matrices Estimation Using a Factor-GARCH Model