Molecular free energies, rates, and mechanisms from data-efficient path sampling simulations

Gianmarco Lazzeri,Hendrik Jung,Peter G. Bolhuis,Roberto Covino
2023-07-28
Abstract:Molecular dynamics is a powerful tool for studying the thermodynamics and kinetics of complex molecular events. However, these simulations can rarely sample the required time scales in practice. Transition path sampling overcomes this limitation by collecting unbiased trajectories capturing the relevant events. Moreover, the integration of machine learning can boost the sampling while simultaneously learning a quantitative representation of the mechanism. Still, the resulting trajectories are by construction non-Boltzmann-distributed, preventing the calculation of free energies and rates. We developed an algorithm to approximate the equilibrium path ensemble from machine learning-guided path sampling data. At the same time, our algorithm provides efficient sampling, the mechanism, free energy, and rates of rare molecular events at a very moderate computational cost. We tested the method on the folding of the mini-protein chignolin. Our algorithm is straightforward and data-efficient, opening the door to applications on many challenging molecular systems.
Chemical Physics,Statistical Mechanics,Computational Physics,Biomolecules
What problem does this paper attempt to address?
The paper aims to address the issue of rare event transitions in molecular dynamics simulations. Specifically, the authors propose a new computational scheme that can simultaneously obtain the dynamical mechanisms, thermodynamic properties, and kinetic rates of rare event transitions with fewer computational resources. The main problems this paper attempts to solve are as follows: 1. **Long Timescale Problem**: Molecular dynamics (MD) simulations in practical applications find it difficult to cover sufficiently long timescales, making it impossible to capture the occurrence of rare events. 2. **Non-Equilibrium Distribution Problem**: Traditional transition path sampling (TPS) methods, although they can improve the sampling efficiency of rare events, generate trajectories that are non-Boltzmann distributed, making it impossible to directly calculate free energy and reaction rates. 3. **Efficient Sampling and Mechanism Discovery**: By combining deep learning with TPS, the path sampling is automatically enhanced, and the transition mechanisms are autonomously learned, thereby improving sampling efficiency. 4. **Free Energy and Reaction Rate Calculation**: An algorithm is proposed that can estimate the free energy profile and reaction rates from a small number of TPS simulations, while extending to the entire configuration space. Through this method, researchers can obtain complete free energy profiles and reaction rate information of complex molecular systems at relatively low computational costs. The paper demonstrates the effectiveness of this method in several benchmark systems, including a 2D system with high energy barriers and the folding process of the mini-protein chignolin.