Statistical and integrative system-level analysis of DNA methylation data

Andrew E. Teschendorff,Caroline L. Relton
DOI: https://doi.org/10.1038/nrg.2017.86
IF: 59.581
2017-11-13
Nature Reviews Genetics
Abstract:Key PointsCell-type heterogeneity can be a major source of confounding and reverse causation in epigenome-wide association studies (EWAS). Adjustment for cell-type composition is therefore critical for an improved interpretation and understanding of EWAS.For a given study, the best choice of cell-type deconvolution algorithm depends not only on the tissue and phenotype of interest but also on the presence of other confounders and the desired output.Most variation in DNA methylation (DNAm) is driven by genetic factors and cell-type heterogeneity, with corresponding features — methylation quantitative trait loci (mQTLs) and cell-type-specific differentially methylated cytosines (DMCs) — readily identifiable using linear modelling.Identification and interpretation of DNAm changes that accrue with age or exposure to environmental disease risk factors may benefit from differential variance statistics.Analysing patterns of covariation in DNAm at regulatory elements can help to identify disrupted regulatory networks and gene modules in disease.The inverse association between DNAm at regulatory elements and transcription factor binding can be exploited to elucidate the functional role of non-coding genome-wide association study (GWAS) single-nucleotide polymorphisms (SNPs) or functional effects caused by exposure to environmental disease risk factors.Mendelian randomization can help to clarify the role of DNAm as a causal mediator between exposure to risk factors and disease.
genetics & heredity
What problem does this paper attempt to address?