Abstract:The mainstream of research in genetics, epigenetics, and imaging data analysis focuses on statistical association or exploring statistical dependence between variables. Despite their significant progresses in genetic research, understanding the etiology and mechanism of complex phenotypes remains elusive. Using association analysis as a major analytical platform for the complex data analysis is a key issue that hampers the theoretic development of genomic science and its application in practice. Causal inference is an essential component for the discovery of mechanical relationships among complex phenotypes. Many researchers suggest making the transition from association to causation. Despite its fundamental role in science, engineering, and biomedicine, the traditional methods for causal inference require at least three variables. However, quantitative genetic analysis such as QTL, eQTL, mQTL, and genomic-imaging data analysis requires exploring the causal relationships between two variables. This paper will focus on bivariate causal discovery with continuous variables. We will introduce independence of cause and mechanism (ICM) as a basic principle for causal inference, algorithmic information theory and additive noise model (ANM) as major tools for bivariate causal discovery. Large-scale simulations will be performed to evaluate the feasibility of the ANM for bivariate causal discovery. To further evaluate their performance for causal inference, the ANM will be applied to the construction of gene regulatory networks. Also, the ANM will be applied to trait-imaging data analysis to illustrate three scenarios: presence of both causation and association, presence of association while absence of causation, and presence of causation, while lack of association between two variables. Telling cause from effect between two continuous variables from observational data is one of the fundamental and challenging problems in omics and imaging data analysis. Our preliminary simulations and real data analysis will show that the ANMs will be one of choice for bivariate causal discovery in genomic and imaging data analysis.

Causal discovery using compression-complexity measures

Casual Compressive Sensing for Gene Network Inference

Learning Causal Structures Based on Divide and Conquer

Causal Discovery and Classification Using Lempel-Ziv Complexity

Causality Network of Infectious Disease Revealed with Causal Decomposition

A Genetic Algorithm for Causal Discovery Based on Structural Causal Model.

Learning domain-specific causal discovery from time series

Normalized multivariate time series causality analysis and causal graph reconstruction

Reinterpreting causal discovery as the task of predicting unobserved joint statistics

Recursively Learning Causal Structures Using Regression-based Conditional Independence Test

Leveraging Structured Biological Knowledge for Counterfactual Inference: A Case Study of Viral Pathogenesis

Inferring Causality from Time Series data based on Structural Causal Model and its application to Neural Connectomics

Implicit Causal Models for Genome-wide Association Studies

Bivariate Causal Discovery and Its Applications to Gene Expression and Imaging Data Analysis

Causality is all you need

Assessing the Effectiveness of Causality Inference Methods for Gene Regulatory Networks

Learning Causality: Synthesis of Large-Scale Causal Networks from High-Dimensional Time Series Data

An Robust N-Gram Causal Inference Approach Based on Multi-model Fusion

Identification and quantification of Granger causality between gene sets

Causal Discovery by Kernel Deviance Measures with Heterogeneous Transforms

EffCause: Discover Dynamic Causal Relationships Efficiently from Time-Series