SDBA: Score Domain-Based Attention for DNA N4-Methylcytosine Site Prediction from Multiperspectives

Ruihao Xin,Fan Zhang,Jiaxin Zheng,Yangyi Zhang,Cuinan Yu,Xin Feng
DOI: https://doi.org/10.1021/acs.jcim.3c00688
IF: 6.162
2023-08-30
Journal of Chemical Information and Modeling
Abstract:In tasks related to DNA sequence classification, choosing the appropriate encoding methods is challenging. Some of the methods encode sequences based on prior knowledge that limits the ability of the model to obtain multiperspective information from the sequences. We introduced a new trainable ensemble method based on the attention mechanism SDBA, which stands for <b>S</b>core <b>D</b>omain-<b>B</b>ased <b>A</b>ttention. Unlike other methods, we fed the task-independent encoding results into the models and dynamically ensembled features from different perspectives using the SDBA mechanism. This approach allows the model to acquire and weight sequence features voluntarily. SDBA is conceptually general and empirically powerful. It has achieved new state-of-the-art results on the benchmark data sets associated with DNA N4-methylcytosine site prediction.
chemistry, multidisciplinary, medicinal,computer science, interdisciplinary applications, information systems
What problem does this paper attempt to address?