A Deep Aggregated Model for Protein Secondary Structure Prediction

Yu Hu,Tiezheng Nie,Derong Shen,Ge Yu
DOI: https://doi.org/10.1504/ijdmb.2019.10022331
2019-01-01
International Journal of Data Mining and Bioinformatics
Abstract:Protein sequence analysis is an important research subject that has drawn increasing attention from biomedical researchers. In this research field, Protein Secondary Structure Predication (PSSP) is a significant sub-project for studying protein spatial structure and biochemical function. However, when only the amino acid residues sequence information can be used as the input, it is a challenge problem to predict the spatial structure of the protein. Recently, the deep learning technology achieves great success in information mining. In this paper, we propose a Deep Neural Block Cascade Network (DeepNBCN) for Protein Secondary Structure Predication. This model is constructed by stacking multiple free-adjusted blocks, each for aggregating Feature Extractor Module and Concate and Activate (C&A) Module. The homogeneous and multi-branch architecture can model the complex internal relationship between amino acid sequence and protein secondary structure sequence. We use two publicly available protein datasets to evaluate the proposed model. Experimental results show that our model can obtain 85% Q 3 accuracy, 86% SOV score, and 75% Q 8 accuracy, respectively, achieving better performance compared with the currently popular predictors.
What problem does this paper attempt to address?