Reconstructing Molecular Networks by Causal Diffusion Do‐Calculus Analysis with Deep Learning
Jiachen Wang,Yuelei Zhang,Luonan Chen,Xiaoping Liu
DOI: https://doi.org/10.1002/advs.202409170
IF: 15.1
2024-10-25
Advanced Science
Abstract:Causal Diffusion Do‐calculus (CDD) is a new method that combines diffusion models with do‐calculus to infer causality between observed variables. Compared with Granger causality (GC) and Mendelian randomization (MR), CDD has a broader range of applications and can infer causal relationships only based on observed data. Furthermore, unlike traditional biological network inference methods such as GENIE3, CDD is grounded in causal theory for its causal inference. Quantifying molecular regulations between genes/molecules causally from observed data is crucial for elucidating the molecular mechanisms underlying biological processes at the network level. Presently, most methods for inferring gene regulatory and biological networks rely on association studies or observational causal‐analysis approaches. This study introduces a novel approach that combines intervention operations and diffusion models within a do‐calculus framework by deep learning, i.e., Causal Diffusion Do‐calculus (CDD) analysis, to infer causal networks between molecules. CDD can extract causal relations from observed data owing to its intervention operations, thereby significantly enhancing the accuracy and generalizability of causal network inference. Computationally, CDD has been applied to both simulated data and real omics data, which demonstrates that CDD outperforms existing methods in accurately inferring gene regulatory networks and identifying causal links from genes to disease phenotypes. Especially, compared with the Mendelian randomization algorithm and other existing methods, the CDD can reliably identify the disease genes or molecules for complex diseases with better performances. In addition, the causal analysis between various diseases and the potential factors in different populations from the UK Biobank database is also conducted, which further validated the effectiveness of CDD.
materials science, multidisciplinary,nanoscience & nanotechnology,chemistry