Causal Mediation Tree Model for Feature Identification on High-Dimensional Mediators

Yao Li,Wei Xu
DOI: https://doi.org/10.1007/s12561-023-09402-9
2023-01-01
Statistics in Biosciences
Abstract:High-dimensional mediation analysis plays an important role in recent biomedical research as a large number of mediators, such as microbiome, could modulate the effect of exposure to the outcome of interest. Most of the current studies focus on modelling independent mediators, but these methods do not consider the non-linear interactive effect between the mediators. Furthermore, it can be challenging to identify features with mediation effects from the high-dimensional mediator space. We proposed an innovative non-parametric approach to build causal mediation trees (CMT) to select important mediators and assess their non-linear interactive mediation effects on the outcome of the study. The data is recursively partitioned into subpopulations constructed by the mediators with the largest mediation effect. We aim to incorporate these non-linear interactions into the mediation framework using this approach and evaluate the total causal effect. Simulation studies were conducted to assess the performance of the CMT algorithm under different scenarios of interactive mediation effects. We applied the method to analyze vaginal microbiome sequencing data from the reproductive-age women’s study. We investigated the causal relationship between ethnic groups and the vaginal pH levels mediated by the vaginal microbiome. We identified three important microbial taxa with strong mediation effects and estimated the total effect of the mediation tree model.
What problem does this paper attempt to address?