Protein Secondary Structure Prediction: A Review of Progress and Directions

Tomasz Smolarczyk,Irena Roterman-Konieczna,Katarzyna Stapor
DOI: https://doi.org/10.2174/1574893614666191017104639
2020-03-10
Current Bioinformatics
Abstract:Background: Over the last few decades, a search for the theory of protein folding has grown into a full-fledged research field at the intersection of biology, chemistry and informatics. Despite enormous effort, there are still open questions and challenges, like understanding the rules by which amino acid sequence determines protein secondary structure. Objective: In this review, we depict the progress of the prediction methods over the years and identify sources of improvement. Methods: The protein secondary structure prediction problem is described followed by the discussion on theoretical limitations, description of the commonly used data sets, features and a review of three generations of methods with the focus on the most recent advances. Additionally, methods with available online servers are assessed on the independent data set. Results: The state-of-the-art methods are currently reaching almost 88% for 3-class prediction and 76.5% for an 8-class prediction. Conclusion: This review summarizes recent advances and outlines further research directions.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?