AFsample2: Predicting multiple conformations and ensembles with AlphaFold2

Yogesh Kalakoti,Björn Wallner
DOI: https://doi.org/10.1101/2024.05.28.596195
2024-06-02
Abstract:Understanding protein dynamics and conformational states carries profound scientific and practical implications for several areas of research, ranging from a general understanding of biological processes at the molecular level to a detailed understanding of disease mechanisms, which in turn can open up new avenues in drug development. Multiple solutions have been recently developed to widen the conformational landscape of predictions made by Alphafold2 (AF2). Here, we introduce AFsample2, a method employing random MSA column masking to reduce the influence of co-evolutionary signals to enhance the structural diversity of models generated by the AF2 neural network. AFsample2 improves the prediction of alternative states for a broad range of proteins, yielding high-quality end states and diverse conformational ensembles. In the data set of open-closed conformations (OC23), alternate state models improved in 17 out of 23 cases without compromising the generation of the preferred state. Consistent results were observed in 16 membrane protein transporters, with improvements in 12 out of 16 targets. TM-score improvements to experimental end states were substantial, sometimes exceeding 50%, elevating mediocre scores from 0.58 to nearly perfect 0.98. Furthermore, AFsample2 increased the diversity of intermediate conformations by 70% compared to the standard AF2 system, producing highly confident models that could potentially be on-path between the two states. In addition, we also propose a way of selecting the end-states in generated model ensembles. These solutions could potentially enhance the generation and identification of alternative protein conformations, thereby providing a more comprehensive understanding of protein function and dynamics. Future work will focus on validating the accuracy of these intermediate conformations and exploring their relevance to functional transitions in proteins.
Bioinformatics
What problem does this paper attempt to address?
The main problem addressed in this paper is how to improve the AlphaFold2 (AF2) method for predicting a protein's multiple conformations and conformation ensembles in order to increase structural diversity. AlphaFold2 is a powerful tool for protein structure prediction, but by default, it tends to generate a single high-confidence model. The paper introduces a new method called AFsample2, which employs a random multiple sequence alignment (MSA) column masking strategy to reduce the impact of co-evolution signals and promote the generation of more structurally diverse models by the neural network. AFsample2 improves the ability to predict alternative states on a range of proteins, particularly in the open-closed conformation dataset (OC23), where it improves alternative state models in 17 cases without compromising preferred state prediction performance. Additionally, there are 12 out of 16 membrane protein transporter targets that show improvement. AFsample2 not only enhances the TM-score (a structural similarity metric) of the final states, but also increases the diversity of intermediate state conformations, with a 70% increase compared to the standard AF2 system. The paper also proposes a method for selecting final states from the generated ensemble of models, which could contribute to a more comprehensive understanding of protein function and dynamics. Future work will involve validating the accuracy of these intermediate conformations and exploring their relevance in protein function transitions.