A Mixed‐Effect Kernel Machine Regression Model for Integrative Analysis of Alpha Diversity in Microbiome Studies

Runzhe Li,Mo Li,Ni Zhao
DOI: https://doi.org/10.1002/gepi.22596
2024-10-02
Genetic Epidemiology
Abstract:Increasing evidence suggests that human microbiota plays a crucial role in many diseases. Alpha diversity, a commonly used summary statistic that captures the richness and/or evenness of the microbial community, has been associated with many clinical conditions. However, individual studies that assess the association between alpha diversity and clinical conditions often provide inconsistent results due to insufficient sample size, heterogeneous study populations and technical variability. In practice, meta‐analysis tools have been applied to integrate data from multiple studies. However, these methods do not consider the heterogeneity caused by sequencing protocols, and the contribution of each study to the final model depends mainly on its sample size (or variance estimate). To combine studies with distinct sequencing protocols, a robust statistical framework for integrative analysis of microbiome datasets is needed. Here, we propose a mixed‐effect kernel machine regression model to assess the association of alpha diversity with a phenotype of interest. Our approach readily incorporates the study‐specific characteristics (including sequencing protocols) to allow for flexible modeling of microbiome effect via a kernel similarity matrix. Within the proposed framework, we provide three hypothesis testing approaches to answer different questions that are of interest to researchers. We evaluate the model performance through extensive simulations based on two distinct data generation mechanisms. We also apply our framework to data from HIV reanalysis consortium to investigate gut dysbiosis in HIV infection.
genetics & heredity,mathematical & computational biology
What problem does this paper attempt to address?