Statistical and computational methods for integrating microbiome, host genomics, and metabolomics data

Rebecca A Deek,Siyuan Ma,James Lewis,Hongzhe Li
DOI: https://doi.org/10.7554/elife.88956
IF: 7.7
2024-06-05
eLife
Abstract:Large-scale microbiome studies are progressively utilizing multiomics designs, which include the collection of microbiome samples together with host genomics and metabolomics data. Despite the increasing number of data sources, there remains a bottleneck in understanding the relationships between different data modalities due to the limited number of statistical and computational methods for analyzing such data. Furthermore, little is known about the portability of general methods to the metagenomic setting and few specialized techniques have been developed. In this review, we summarize and implement some of the commonly used methods. We apply these methods to real data sets where shotgun metagenomic sequencing and metabolomics data are available for microbiome multiomics data integration analysis. We compare results across methods, highlight strengths and limitations of each, and discuss areas where statistical and computational innovation is needed.
biology
What problem does this paper attempt to address?