Inferring gene regulatory networks using DNA methylation data

Thomas E Bartlett,Melodie Li,Qiulin Huang
DOI: https://doi.org/10.1101/2024.04.23.590858
2024-06-24
Abstract:We show much-improved accuracy of inference of GRN (gene regulatory network) structure, resulting from the use of an epigenomic prior network. We also find that DNAme data are very effective for inferring the epigenomic prior network, recapitulating known epigenomic network structure found previously from chromatin accessibility data, and typically providing potential TF cis-regulations for at least eight times as many genes when compared with chromatin accessibility data. When our proposed methodology is applied to real datasets from human embryonic development and from women at risk of breast cancer, we find patterns of differential cis-regulation that are in line with expectations under appropriate biological models, and that can be used to identify pre-cancerous epigenomic changes with valid functional genomic interpretations.
Genomics
What problem does this paper attempt to address?
The paper aims to address the following key issues: 1. **Improving the accuracy of Gene Regulatory Network (GRN) inference**: By utilizing epigenomic prior networks (specifically DNA methylation data), researchers aim to improve methods for inferring GRNs from gene expression data to enhance their accuracy. 2. **Validating the effectiveness of DNA methylation data in inferring epigenomic prior networks**: The paper demonstrates that DNA methylation data can be effectively used to infer epigenomic prior networks and that the results are consistent with those previously obtained from chromatin accessibility data. 3. **Application to real datasets related to human embryonic development and breast cancer risk prediction**: By applying the proposed method to real datasets related to human embryonic development and breast cancer risk, researchers hope to identify differential cis-regulatory patterns associated with cell fate determination and pre-cancerous epigenomic changes. Specifically, the goal of the paper is to develop a new method for inferring the structure of GRNs in specific cell types, combining advanced regression techniques and epigenomic data (such as DNA methylation). This method first uses epigenomic data to infer a "prior network" and then further optimizes this network using gene expression data. The advantage of this approach is that it significantly reduces the number of potential regulators, thereby improving the accuracy and efficiency of GRN inference. Additionally, this method has been successfully applied to real cases of human embryonic development and breast cancer risk assessment to reveal mechanisms of cell fate determination and early epigenetic changes in cancer.