Abstract:Predicting protein structure is both fascinating and formidable, playing a crucial role in structure-based drug discovery and unraveling diseases with elusive origins. The Critical Assessment of Protein Structure Prediction (CASP) serves as a biannual battleground where global scientists converge to untangle the intricate relationships within amino acid chains. Two primary methods, Template-Based Modeling (TBM) and Template-Free (TF) strategies, dominate protein structure prediction. The trend has shifted towards Template-Free predictions due to their broader sequence coverage with fewer templates. The predictive process can be broadly classified into contact map, binned-distance, and real-valued distance predictions, each with distinctive strengths and limitations manifested through tailored loss functions. We have also introduced revolutionary end-to-end, and all-atom diffusion-based techniques that have transformed protein structure predictions. Recent advancements in deep learning techniques have significantly improved prediction accuracy, although the effectiveness is contingent upon the quality of input features derived from natural bio-physiochemical attributes and Multiple Sequence Alignments (MSA). Hence, the generation of high-quality MSA data holds paramount importance in harnessing informative input features for enhanced prediction outcomes. Remarkable successes have been achieved in protein structure prediction accuracy, however not enough for what structural knowledge was intended to, which implies need for development in some other aspects of the predictions. In this regard, scientists have opened other frontiers for protein structural prediction. The utilization of subsampling in multiple sequence alignment (MSA) and protein language modeling appears to be particularly promising in enhancing the accuracy and efficiency of predictions, ultimately aiding in drug discovery efforts. The exploration of predicting protein complex structure also opens up exciting opportunities to deepen our knowledge of molecular interactions and design therapeutics that are more effective. In this article, we have discussed the vicissitudes that the scientists have gone through to improve prediction accuracy, and examined the effective policies in predicting from different aspects, including the construction of high quality MSA, providing informative input features, and progresses in deep learning approaches. We have also briefly touched upon transitioning from predicting single-chain protein structures to predicting protein complex structures. Our findings point towards promoting open research environments to support the objectives of protein structure prediction.

Protein Structure Prediction: Challenges, Advances, and the Shift of Research Paradigms

Protein Structure Prediction: Conventional and Deep Learning Perspectives

Recent Advances and Challenges in Protein Structure Prediction

AI-Driven Deep Learning Techniques in Protein Structure Prediction

Recent Progress of Protein Tertiary Structure Prediction

A Review of Protein Structure Prediction using Deep Learning

Unveiling the evolution of policies for enhancing protein structure predictions: A comprehensive analysis

Recent developments in deep learning applied to protein structure prediction

Deep Learning in Protein Structural Modeling and Design

Deep Learning-Based Advances in Protein Structure Prediction

AI told you so: navigating protein structure prediction in the era of machine learning

Advances in protein structure prediction and design

Structure-based protein design with deep learning

Deep learning techniques have significantly impacted protein structure prediction and protein design

Deep learning for protein structure prediction and design—progress and applications

Improved protein structure prediction using potentials from deep learning

Advances of Deep Learning in Protein Science: A Comprehensive Survey

A glance into the evolution of template-free protein structure prediction methodologies

Recent Progress in Machine Learning-Based Methods for Protein Fold Recognition

Protein Language Models and Structure Prediction: Connection and Progression

Recent Applications of Deep Learning Methods on Evolution- and Contact-Based Protein Structure Prediction