Protein Remote Homology Detection Based on Deep Convolutional Neural Network

Yu Wang,JunPeng Bao,Fangfang Huang,Jianqiang Du,Yongfeng Li
DOI: https://doi.org/10.21203/rs.2.15388/v1
2019-01-01
Abstract:Abstract Background: Protein remote homology detection has long received great attention in the field of bioinformatics, but the property of low protein sequence similarities heavily influences the accuracy of detection. Recently, such deep learning methods as LSTM have been adopted to deal with the problem. However, LSTM-based models will consume much time during the training process because of their cyclic connection mechanism and such problem will become more serious when dealing with long protein sequences. Results: In this paper, we propose a CNN-based network, called ConvRes, to address the aforementioned shortcomings of existing methods in this field, which combines a variant Inception and Resnet block. Experimental results show that (1) this CNN-based network can classify the family of the remote homology proteins with comparable precision to the existing state-of-art method (ProDec-BLSTM) on the SCOP benchmark dataset. (2) ConvRes consumes less time with using only 15000 seconds whereas ProDec-BLSTM taking 150000 seconds. Conclusion: This paper showcases that our proposed ConvRes network outperforms other existing models with regard to detecting remote homology proteins. The experimental results prove ConvRes network to be a viable and efficient model for remote homology protein detection. In the future work, we will improve the performance of ConvRes network by using other dataset and explore new representations to adapt for variable-length sequences.
What problem does this paper attempt to address?