Abstract:Molecular dynamics (MD) simulations are ideally suited to describe conformational ensembles of biomolecules such as proteins and nucleic acids. Microsecond-long simulations are now routine, facilitated by the emergence of graphical processing units. Processing such ensembles on the basis of statistical mechanics can bring insights about different biologically relevant states, their representative structures, states, and even dynamics between states. Clustering, which groups objects based on structural similarity, is typically used to process ensembles, leading to different states, their populations, and the identification of representative structures. For some purposes, such as in protein structure prediction, we are interested in identifying the representative structure that is more similar to the native state of the protein. The traditional pipeline combines hierarchical clustering for clustering and selecting the cluster centroid as representative of the cluster. However, even when the first cluster represents the native basin, the centroid can be several angstroms away in RMSD from the native state – and many other structures inside this cluster could be better choices of representative structures, reducing the need for protein structure refinement. In this study, we developed a module—Protein Retrieval via Integrative Molecular Ensemble (PRIME), that consists of tools to determine the most prevalent states in an ensemble using extended continuous similarity. PRIME is integrated with our Molecular Dynamics Analysis with -ary Clustering Ensembles (MDANCE) package and can be used as a post-processing tool for arbitrary clustering algorithms, compatible with several MD suites. PRIME was validated with ensembles of different protein and protein complex systems for their ability to reliably identify the most native-like state, which we compare to their experimental structure, and to the traditional approach. Systems were chosen to represent different degrees of difficulty such as folding processes and binding which require large conformational changes. PRIME predictions produced structures that when aligned to the experimental structure were better superposed (lower RMSD). A further benefit of PRIME is its linear scaling – rather than the traditional O( ) traditionally associated to comparisons of elements in a set.

Reply To Jensen And Blackledge: Dual Quantifications Of Intrinsically Disordered Proteins By Nmr Ensembles And Molecular Dynamics Simulations

Multiscaled Exploration of Coupled Folding and Binding of an Intrinsically Disordered Molecular Recognition Element in Measles Virus Nucleoprotein

Reconciling membrane protein simulations with experimental DEER spectroscopy data

Combining molecular dynamics simulations with small-angle X-ray and neutron scattering data to study multi-domain proteins in solution.

Determining accurate conformational ensembles of intrinsically disordered proteins at atomic resolution

Ensemble MD Simulations Restrained Via Crystallographic Data: Accurate Structure Leads to Accurate Dynamics

Protein Retrieval via Integrative Molecular Ensembles (PRIME) through extended similarity indices

Structural Characterization of N-WASP Domain V Using MD Simulations with NMR and SAXS Data

QEBSS: Quality Evaluation Based Simulation Selection for analysis of conformational ensembles and dynamics of multidomain proteins

Gradations in protein dynamics captured by experimental NMR are not well represented by AlphaFold2 models and other computational metrics

Streamlining NMR Chemical Shift Predictions for Intrinsically Disordered Proteins: Design of Ensembles with Dimensionality Reduction and Clustering

Structure Determination of Challenging Protein-Peptide Complexes Combining NMR Chemical Shift Data and Molecular Dynamics Simulations

Probing the dynamic landscape of peptides in molecular assemblies by synergized NMR experiments and MD simulations

Dynamics in the intact fd bacteriophage revealed by pseudo 3D REDOR-based magic angle spinning NMR

NMR‐assisted protein structure prediction with MELDxMD

Structure determination of a flexible cyclic peptide based on NMR and MD simulation 3J-coupling

An analysis of double-quantum coherence ESR in an N -spin system: Analytical expressions and predictions

Conformational Dynamics of Supramolecular Protein Assemblies in the EMDB

Quantitative Ensemble Interpretation of Membrane Paramagnetic Relaxation Enhancement (mPRE) for Studying Membrane-Associated Intrinsically Disordered Proteins

Predicting protein dynamics from structural ensembles

Prediction of nearest neighbor effects on backbone torsion angles and NMR scalar coupling constants in disordered proteins