Stochastic Optimal Control for Collective Variable Free Sampling of Molecular Transition Paths

Lars Holdijk,Yuanqi Du,Ferry Hooft,Priyank Jaini,Bernd Ensing,Max Welling
2023-07-18
Abstract:We consider the problem of sampling transition paths between two given metastable states of a molecular system, e.g. a folded and unfolded protein or products and reactants of a chemical reaction. Due to the existence of high energy barriers separating the states, these transition paths are unlikely to be sampled with standard Molecular Dynamics (MD) simulation. Traditional methods to augment MD with a bias potential to increase the probability of the transition rely on a dimensionality reduction step based on Collective Variables (CVs). Unfortunately, selecting appropriate CVs requires chemical intuition and traditional methods are therefore not always applicable to larger systems. Additionally, when incorrect CVs are used, the bias potential might not be minimal and bias the system along dimensions irrelevant to the transition. Showing a formal relation between the problem of sampling molecular transition paths, the Schrödinger bridge problem and stochastic optimal control with neural network policies, we propose a machine learning method for sampling said transitions. Unlike previous non-machine learning approaches our method, named PIPS, does not depend on CVs. We show that our method successful generates low energy transitions for Alanine Dipeptide as well as the larger Polyproline and Chignolin proteins.
Biomolecules,Machine Learning,Chemical Physics
What problem does this paper attempt to address?
The paper aims to address the problem of sampling transition paths between two metastable states in molecular systems, particularly for high energy barrier issues encountered in processes such as protein folding, conformational changes, and chemical reactions. Traditional molecular dynamics (MD) simulations often struggle to effectively sample these transition paths because the energy barriers they need to cross are usually very high. To tackle this challenge, the researchers propose a method called PIPS (Path Integral Path Sampling), which is a new approach based on path integral stochastic optimal control theory. Unlike traditional methods that rely on Collective Variables (CVs), PIPS does not depend on predefined CVs but operates directly on the overall geometric structure of the molecule. The advantage of this method is that it can be applied to large systems where finding suitable CVs is difficult. Specifically, PIPS uses stochastic optimal control theory to train a parameterized bias potential to improve the efficiency of sampling transition paths from one metastable state to another. This method not only overcomes the difficulty of selecting appropriate CVs but can also be effectively applied to larger molecular systems. The paper experimentally validates the method through three different molecular systems: Alanine Dipeptide, Polyproline, and Chignolin protein. The experimental results show that the PIPS method can successfully generate low-energy transition paths without relying on CVs and has higher efficiency and accuracy compared to traditional methods. Additionally, PIPS demonstrates the ability to identify the correct collective variables for sampling even when the target state is not specified correctly. This indicates that PIPS could be used to validate the effectiveness of candidate collective variables.