Weighted variance component test for the integrative multi-omics analysis of microbiome data

Angela Zhang,Wodan Ling,Amarise Little,Jessica S Williams-Nguyen,Jee-Young Moon,Robert D Burk,Rob Knight,Dong D Wang,Qibin Qi,Robert Kaplan,Ni Zhao,Michael Wu
DOI: https://doi.org/10.1101/2024.06.14.599073
2024-06-17
Abstract:Metabolic dysregulation and alterations have been linked to various diseases and conditions. Innovations in high-throughput technology now allow rapid profiling of the metabolome and metagenome — often the gene content of bacterial populations -– for characterizing metabolism. Due to the small sample sizes and high dimensionality of the data, pathway analysis (wherein the effect of multiple genes or metabolites on an outcome is cumulatively assessed) of metabolomic data is commonly conducted and also represents a standard for metagenomic analysis. However, how to integrate both data types remains unclear. Recognizing that a metabolic pathway can be complementarily characterized by both metagenomics and metabolomics, we propose a weighted variance components framework to test if the joint effect of genes and metabolites in a biological pathway is associated with outcomes. The approach allows analytic p-value calculation, correlation between data types, and optimal weighting. Power simulations show that our approach often outperforms other strategies while maintaining type I error. The approach is illustrated on real data.
Bioinformatics
What problem does this paper attempt to address?