Abstract:Expression Quantitative Trait Locus (eQTL) analysis is a powerful tool to study the biological mechanisms linking the genotype with gene expression. Such analyses can identify genomic locations where genotypic variants influence the expression of genes, both in close proximity to the variant (cis-eQTL), and on other chromosomes (trans-eQTL). Many traditional eQTL methods are based on a linear regression model. In this study, we propose a novel method by which to identify eQTL associations with information theory and machine learning approaches. Mutual Information (MI) is used to describe the association between genetic marker and gene expression. MI can detect both linear and non-linear associations. What’s more, it can capture the heterogeneity of the population. Advanced feature selection methods, Maximum Relevance Minimum Redundancy (mRMR) and Incremental Feature Selection (IFS), were applied to optimize the selection of the affected genes by the genetic marker. When we applied our method to a study of apoE-deficient mice, it was found that the cis-acting eQTLs are stronger than trans-acting eQTLs but there are more trans-acting eQTLs than cis-acting eQTLs. We compared our results (mRMR.eQTL) with R/qtl, and MatrixEQTL (modelLINEAR and modelANOVA). In female mice, 67.9% of mRMR.eQTL results can be confirmed by at least two other methods while only 14.4% of R/qtl result can be confirmed by at least two other methods. In male mice, 74.1% of mRMR.eQTL results can be confirmed by at least two other methods while only 18.2% of R/qtl result can be confirmed by at least two other methods. Our methods provide a new way to identify the association between genetic markers and gene expression. Our software is available from supporting information.

Statistical and Machine Learning Methods for Eqtl Analysis

An Information-Theoretic Machine Learning Approach to Expression QTL Analysis

Integrating genetic and gene expression data: methods and applications of eQTL mapping]

Distributed Eqtl Analysis with Auxiliary Information

Adaptive Multi-Task Lasso: with Application to Eqtl Detection.

eQTL Mapping via Effective SNP Ranking and Screening

The Single-Cell Eqtlgen Consortium

Single-cell Eqtlgen Consortium: a Personalized Understanding of Disease

Discovering Eqtl Regulatory Patterns Through Eqtlmotif

Sparse Regression Models for Unraveling Group and Individual Associations in Eqtl Mapping

Quantile regression for challenging cases of eQTL mapping

Expression Quantitative Trait Loci (eqtl) Analysis in Cancer.

HT-eQTL: Integrative Expression Quantitative Trait Loci Analysis in a Large Number of Human Tissues

A spectral framework to map QTLs affecting joint differential networks of gene co-expression

Statistical Approaches in Qtl Mapping and Molecular Breeding for Complex Traits

Statistical Learning Methods For Genome-based Analysis Of Quantitative Traits

Estimation of Interpretable Eqtl Effect Sizes Using a Log of Linear Model

Gene Set Enrichment in Eqtl Data Identifies Novel Annotations and Pathway Regulators.

Expression Quantitative Trait Loci Analysis in Plants

A Robust Statistical Method For Association-Based Eqtl Analysis

Network-based group variable selection for detecting expression quantitative trait loci (eQTL)