A deep learning based two-layer predictor to identify enhancers and their strength

Di Zhu,Wen Yang,Dali Xu,Hongfei Li,Yuming Zhao,Dan Li
DOI: https://doi.org/10.1016/j.ymeth.2023.01.007
IF: 4.647
2023-03-01
Methods
Abstract:The enhancer is a DNA sequence that can increase the activity of promoters and thus speed up the frequency of gene transcription. The enhancer plays an essential role in activating gene expression. Currently, gene sequencing technology has been developed for 30 years from the first generation to the third generation, and a variety of biological sequence data have increased significantly every year. Due to the importance of enhancer functions, it is very expensive to identify enhancers through biochemical experiments. Therefore, we need to study new methods for the identification and classification of enhancers. Based on the K-mer principle this study proposed a feature extraction method that others have not used in convolutional neural networks. Then, we combined it with one-hot encoding to build an efficient one-dimensional convolutional neural network ensemble model for predicting enhancers and their strengths. Finally, we used five commonly used classification problem evaluation indicators to compare with the models proposed by other researchers. The model proposed in this paper has a better performance by using the same independent test dataset as other models.
biochemistry & molecular biology,biochemical research methods
What problem does this paper attempt to address?