Cox-MKF: A Knockoff Filter for High-Dimensional Mediation Analysis with a Survival Outcome in Epigenetic Studies

Peixin Tian,Minhao Yao,Tao Huang,Zhonghua Liu
DOI: https://doi.org/10.1093/bioinformatics/btac687
IF: 5.8
2022-01-01
Bioinformatics
Abstract:Motivation It is of scientific interest to identify DNA methylation CpG sites that might mediate the effect of an environmental exposure on a survival outcome in high-dimensional mediation analysis. However, there is a lack of powerful statistical methods that can provide a guarantee of false discovery rate (FDR) control in finite-sample settings. Results In this article, we propose a novel method called CoxMKF, which applies aggregation of multiple knockoffs to a Cox proportional hazards model for a survival outcome with high-dimensional mediators. The proposed CoxMKF can achieve FDR control even in finite-sample settings, which is particularly advantageous when the sample size is not large. Moreover, our proposed CoxMKF can overcome the randomness of the unstable model-X knockoffs. Our simulation results show that CoxMKF controls FDR well in finite samples. We further apply CoxMKF to a lung cancer dataset from The Cancer Genome Atlas (TCGA) project with 754 subjects and 365 306 DNA methylation CpG sites, and identify four DNA methylation CpG sites that might mediate the effect of smoking on the overall survival among lung cancer patients. Availability and implementation: The R package CoxMKF is publicly available at https://github.com/MinhaoYaooo/CoxMKF. Contact: zl2509@cumc.columbia.edu Supplementary information: Supplementary data are available at Bioinformatics online.
What problem does this paper attempt to address?