DeepFrag-k: a Fragment-Based Deep Learning Approach for Protein Fold Recognition.

Elhefnawy Wessam,Li Min,Wang Jianxin,Li Yaohang
DOI: https://doi.org/10.1186/s12859-020-3504-z
IF: 3.307
2020-01-01
BMC Bioinformatics
Abstract:BACKGROUND:One of the most essential problems in structural bioinformatics is protein fold recognition. In this paper, we design a novel deep learning architecture, so-called DeepFrag-k, which identifies fold discriminative features at fragment level to improve the accuracy of protein fold recognition. DeepFrag-k is composed of two stages: the first stage employs a multi-modal Deep Belief Network (DBN) to predict the potential structural fragments given a sequence, represented as a fragment vector, and then the second stage uses a deep convolutional neural network (CNN) to classify the fragment vector into the corresponding fold.RESULTS:Our results show that DeepFrag-k yields 92.98% accuracy in predicting the top-100 most popular fragments, which can be used to generate discriminative fragment feature vectors to improve protein fold recognition.CONCLUSIONS:There is a set of fragments that can serve as structural "keywords" distinguishing between major protein folds. The deep learning architecture in DeepFrag-k is able to accurately identify these fragments as structure features to improve protein fold recognition.
What problem does this paper attempt to address?