Identifying Modifications on DNA-bound Histones with Joint Deep Learning of Multiple Binding Sites in DNA Sequence

Yan Li,Lijun Quan,Yiting Zhou,Yelu Jiang,Kailong Li,Tingfang Wu,Qiang Lyu
DOI: https://doi.org/10.1093/bioinformatics/btac489
IF: 5.8
2022-01-01
Bioinformatics
Abstract:Motivation: Histone modifications are epigenetic markers that impact gene expression by altering the chromatin structure or recruiting histone modifiers. Their accurate identification is key to unraveling the mechanisms by which they regulate gene expression. However, the solutions for this task can be improved by exploiting multiple relationships from dataset and exploring designs of learning models, for example jointly learning technology. Results: This article proposes a deep learning-based multi-objective computational approach, iHMnBS, to identify which of the seven typical histone modifications a DNA sequence may choose to bind, and which parts of the DNA sequence bind to them. iHMnBS employs a customized dataset that allows the marking of modifications contained in histones that may bind to any position in the DNA sequence. iHMnBS tries to mine the information implicit in this richer data by means of deep neural networks. In comprehensive comparisons, iHMnBS outperforms a baseline method, and the probability of binding to modified histones assigned to a representative nucleotide of a DNA sequence can serve as a reference for biological experiments. Since the interaction between transcription factors and histone modifications has an important role in gene expression, we extracted a number of sequence patterns that may bind to transcription factors, and explored their possible impact on disease.
What problem does this paper attempt to address?