CatLearning: highly accurate gene expression prediction from histone mark

Weining Lu,Yin Tang,Yu Liu,Shiyi Lin,Qifan Shuai,Bin Liang,Rongqing Zhang,Yu Cheng,Dong Fang
DOI: https://doi.org/10.1093/bib/bbae373
IF: 9.5
2024-07-25
Briefings in Bioinformatics
Abstract:Abstract Histone modifications, known as histone marks, are pivotal in regulating gene expression within cells. The vast array of potential combinations of histone marks presents a considerable challenge in decoding the regulatory mechanisms solely through biological experimental approaches. To overcome this challenge, we have developed a method called CatLearning. It utilizes a modified convolutional neural network architecture with a specialized adaptation Residual Network to quantitatively interpret histone marks and predict gene expression. This architecture integrates long-range histone information up to 500Kb and learns chromatin interaction features without 3D information. By using only one histone mark, CatLearning achieves a high level of accuracy. Furthermore, CatLearning predicts gene expression by simulating changes in histone modifications at enhancers and throughout the genome. These findings help comprehend the architecture of histone marks and develop diagnostic and therapeutic targets for diseases with epigenetic changes.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?