ProtSCAPE: Mapping the landscape of protein conformations in molecular dynamics

Siddharth Viswanath,Dhananjay Bhaskar,David R. Johnson,Joao Felipe Rocha,Egbert Castro,Jackson D. Grady,Alex T. Grigas,Michael A. Perlmutter,Corey S. O'Hern,Smita Krishnaswamy
2024-10-27
Abstract:Understanding the dynamic nature of protein structures is essential for comprehending their biological functions. While significant progress has been made in predicting static folded structures, modeling protein motions on microsecond to millisecond scales remains challenging. To address these challenges, we introduce a novel deep learning architecture, Protein Transformer with Scattering, Attention, and Positional Embedding (ProtSCAPE), which leverages the geometric scattering transform alongside transformer-based attention mechanisms to capture protein dynamics from molecular dynamics (MD) simulations. ProtSCAPE utilizes the multi-scale nature of the geometric scattering transform to extract features from protein structures conceptualized as graphs and integrates these features with dual attention structures that focus on residues and amino acid signals, generating latent representations of protein trajectories. Furthermore, ProtSCAPE incorporates a regression head to enforce temporally coherent latent representations.
Machine Learning,Chemical Physics,Biomolecules,Quantitative Methods
What problem does this paper attempt to address?
This paper attempts to address a key issue in protein dynamics modeling, specifically how to capture the dynamic changes of proteins on the microsecond to millisecond timescale. While significant progress has been made in predicting static folded structures, challenges remain in simulating protein movements. To this end, the authors introduce a new deep learning architecture—ProteinTransformer with Scattering, Attention, and Positional Embedding (ProtSCAPE). ### Main Issues: 1. **Challenges in Protein Dynamics Modeling**: - Current methods struggle to capture the dynamic changes of proteins on the microsecond to millisecond timescale. - Most techniques rely on 1D projections, which may not fully encompass the complexity of protein dynamics, leading to incomplete or misleading interpretations. 2. **Need for Improved Methods for Protein Trajectory Analysis and Representation**: - There is a need to develop better methods to analyze and represent protein trajectories to better understand protein functions and dynamic changes. ### Solution: - **ProtSCAPE**: By combining geometric scattering transforms and Transformer-based attention mechanisms, it extracts protein dynamic features from molecular dynamics (MD) simulations. - **Multi-scale Feature Extraction**: Utilizes the multi-scale nature of geometric scattering transforms to extract features from protein structure graphs. - **Dual Attention Mechanism**: Focuses on residue and amino acid signals to generate latent representations of protein trajectories. - **Regression Head**: Used to generate temporally coherent latent representations, ensuring the model can generalize from short trajectories to long trajectories and from wild-type trajectories to mutant trajectories. ### Experimental Results: - **Interpolation and Reconstruction**: ProtSCAPE successfully interpolates intermediate structures from open to closed states and performs well in reconstructing mutant structures. - **Generalization Ability**: The model can handle not only short trajectories but also generalize to long trajectories; it can handle wild-type trajectories and generalize to mutant trajectories. ### Summary: By introducing the ProtSCAPE model, this paper aims to address key challenges in protein dynamics modeling, particularly the dynamic changes on the microsecond to millisecond timescale. By combining geometric scattering transforms and Transformer-based attention mechanisms, ProtSCAPE can capture protein dynamic changes more accurately and interpretably, providing a powerful tool for protein function research.