Empirical Bayes for Dynamic Bayesian Networks Using Generalized Variational Inference

Vyacheslav Kungurtsev,Apaar,Aarya Khandelwal,Parth Sandeep Rastogi,Bapi Chatterjee,Jakub Mareček
2024-06-29
Abstract:In this work, we demonstrate the Empirical Bayes approach to learning a Dynamic Bayesian Network. By starting with several point estimates of structure and weights, we can use a data-driven prior to subsequently obtain a model to quantify uncertainty. This approach uses a recent development of Generalized Variational Inference, and indicates the potential of sampling the uncertainty of a mixture of DAG structures as well as a parameter posterior.
Machine Learning,Statistics Theory
What problem does this paper attempt to address?
The paper aims to address the problem of structure learning in Dynamic Bayesian Networks (DBNs) under high-dimensional feature spaces and limited data samples. Specifically, the paper proposes a new framework that combines the empirical Bayesian method and Generalized Variational Inference (GVI) to tackle the following main issues: 1. **Structural Uncertainty**: Determining the exact structure of a DBN becomes impractical when data is insufficient. The paper proposes estimating structural uncertainty by computing a set of approximate, noisy graph structures and mixing these structures in a broader statistical process. 2. **Mixture Model Construction**: Directly constructing a mixture model in high-dimensional settings is challenging because suitable encoding methods become complex as the problem's dimensionality increases. By first finding a set of high-quality models as the basis for the mixture, Bayesian learning can be effectively performed, i.e., quantifying uncertainty in non-asymptotic sample sizes without incurring the potential computational overhead of encoding and sampling the entire DBN representation at once. 3. **Optimization and Sampling**: To simplify the sampling computation task in high-dimensional but small sample settings, the paper assumes a Linear Structural Equation Model (LSEM) and optimizes parameters and structure through Integer Programming (IP) and Generalized Variational Inference methods. In summary, the main objective of the paper is to develop a new method capable of effectively handling uncertainty and complexity in DBN structure learning on limited datasets, particularly in applications within high-dimensional feature spaces.