An Evaluation of Force Field Accuracy for the Mini-Protein Chignolin using Markov State Models

Vincent Voelz,Tim Marshall,Robert Raddi
DOI: https://doi.org/10.26434/chemrxiv-2024-xmztm
2024-03-04
Abstract:All-atom molecular dynamics (MD) simulations can provide detailed insight into a molecule's conformational ensemble in solution. While molecular force fields are parameterized to accurately model a protein's potential energy surface, it remains challenging in practice to evaluate how well force fields can capture ensemble-averaged experimental observables, since it requires simulation of the complete folding landscape. In this work, we employ massively parallel molecular simulations, performed using the Folding@home distributed computing platform, to investigate the ability of nine force fields (AMBER14SB, AMBER99, AMBER99SB, AMBER99SB-ildn, AMBER99SBnmr1-ildn, CHARMM22*, CHARMM27, CHARMM36 and OPLS-aa) with TIP3P explicit solvent to accurately reproduce experimental observables for chignolin, a beta-hairpin mini-protein with an experimental folding time of ~600 ns. From over 200 µs of aggregate trajectory data, we constructed Markov state models (MSMs) to obtain estimates of thermodynamic and kinetic properties of chignolin in each force field. Quantitative assessment of the force fields was performed by comparing predicted and experimental folded populations, and the statistical agreement between predicted and experimental solution-state NMR observables. This work highlights the utility of MSM approaches for force field evaluation, and provides a baseline for future studies using Bayesian inference methods to evaluate and parameterize force fields.
Chemistry
What problem does this paper attempt to address?
The paper aims to evaluate the accuracy of nine force fields (AMBER14SB, AMBER99, AMBER99SB, AMBER99SB-ildn, AMBER99SBnmr1-ildn, CHARMM22*, CHARMM27, CHARMM36, and OPLS-aa) in simulating the mini-protein Chignolin. Specifically, the researchers utilized the Folding@home distributed computing platform to conduct large-scale parallel molecular dynamics simulations, generating over 200 microseconds of trajectory data and constructing Markov State Models (MSMs) to estimate the thermodynamic and kinetic properties of Chignolin under each force field. By comparing the predicted folded state populations with experimental results and statistically assessing the agreement between predicted and experimental solution-state NMR observables, the performance of these force fields was quantitatively evaluated. This work not only highlights the utility of the MSM approach in force field evaluation but also provides a benchmark for future use of Bayesian inference methods to assess and parameterize force fields. In summary, the core issue of the paper is to assess the accuracy and reliability of different force fields in simulating the β-hairpin mini-protein Chignolin, particularly whether they can accurately reproduce experimentally observed folded state populations and other solution-state NMR observables.