Doublethink: simultaneous Bayesian-frequentist model-averaged hypothesis testing

Helen R. Fryer,Nicolas Arning,Daniel J. Wilson
2024-07-27
Abstract:Establishing the frequentist properties of Bayesian approaches widens their appeal and offers new understanding. In hypothesis testing, Bayesian model averaging addresses the problem that conclusions are sensitive to variable selection. But Bayesian false discovery rate (FDR) guarantees are contingent on prior assumptions that may be disputed. Here we show that Bayesian model-averaged hypothesis testing is a closed testing procedure that controls the frequentist familywise error rate (FWER) in the strong sense. The rate converges pointwise as the sample size grows and, under some conditions, uniformly. The `Doublethink' method computes simultaneous posterior odds and asymptotic p-values for model-averaged hypothesis testing. We explore its benefits, including post-hoc variable selection, and limitations, including finite-sample inflation, through a Mendelian randomization study and simulations comparing approaches like LASSO, stepwise regression, the Benjamini-Hochberg procedure and e-values.
Methodology,Statistics Theory
What problem does this paper attempt to address?
The paper attempts to address the following issues: 1. **Bayesian Methods in Model Averaging Hypothesis Testing**: Bayesian model averaging addresses the sensitivity of conclusions due to variable selection in hypothesis testing. However, the Bayesian false discovery rate (FDR) guarantee relies on potentially controversial prior assumptions. 2. **Unification of Bayesian and Frequentist Methods**: The paper demonstrates that Bayesian model averaging hypothesis testing is a closed testing procedure that can control the frequentist family-wise error rate (FWER) in a strong sense. As the sample size increases, the error rate converges pointwise and, under certain conditions, uniformly. 3. **Doublethink Method**: A new method called "Doublethink" is proposed, which can simultaneously compute the posterior odds and asymptotic p-values for model averaging hypothesis testing. This method is compared with others (such as LASSO, stepwise regression, the Benjamini-Hochberg procedure, and e-values) through Mendelian randomization studies and simulations. In summary, the paper aims to enhance the robustness and interpretability of hypothesis testing by combining Bayesian and frequentist methods, especially in the context of large-scale datasets.