Score Dynamics: Scaling Molecular Dynamics with Picoseconds Time Steps via Conditional Diffusion Model

Tim Hsu,Babak Sadigh,Vasily Bulatov,Fei Zhou
DOI: https://doi.org/10.1021/acs.jctc.3c01361
2024-03-16
Journal of Chemical Theory and Computation
Abstract:We propose score dynamics (SD), a general framework for learning accelerated evolution operators with large timesteps from molecular dynamics (MD) simulations. SD is centered around scores or derivatives of the transition log-probability with respect to the dynamical degrees of freedom. The latter play the same role as force fields in MD but are used in denoising diffusion probability models to generate discrete transitions of the dynamical variables in an SD time step, which can be orders of...
chemistry, physical,physics, atomic, molecular & chemical
What problem does this paper attempt to address?
The paper aims to address the issue of time scale extension in Molecular Dynamics (MD) simulations. Specifically, it proposes a new method called "ScoreDynamics" (SD) that accelerates the time steps in MD simulations through a conditional diffusion model, achieving time step extensions from femtoseconds (fs) to picoseconds (ps). ### Main Issues 1. **Time Scale Extension**: Traditional MD simulations are limited by extremely small time steps (about 1 fs), which severely restricts the spatiotemporal scale of simulations, especially when sampling rare events. The paper attempts to extend the time step to the picosecond level (10 ps) by introducing the SD method, thereby significantly improving simulation efficiency. 2. **Rare Event Sampling**: Rare events are crucial in many scientific studies, but traditional MD methods struggle to sample these events effectively. The SD method generates discrete transitions of molecular conformations through a conditional diffusion model, enabling more efficient sampling of rare events. 3. **Model Generalization**: The paper also explores the generalization ability of the SD model, i.e., whether the model can be applied to unseen molecular systems. The results show that the SD model maintains high fidelity in both dynamics and equilibrium states even on unseen butane molecules. ### Solution - **ScoreDynamics Framework**: The core of the SD method is to use the derivatives of scores or transition log-probabilities to construct accelerated evolution operators. These scores act similarly to force fields in SD, generating discrete transitions of molecular conformations at each time step. - **Conditional Diffusion Model**: The SD method uses a conditional diffusion model to generate new conformations at each time step. By learning the mapping from Gaussian noise to real data, the model can generate reasonable molecular conformations at larger time steps. - **Graph Neural Network**: The paper employs Graph Neural Networks (GNN) to implement the SD model. GNNs can handle the complex structural information of molecular systems and effectively learn the dynamics of molecular systems during training. ### Experimental Validation - **Case Studies**: The paper validates the effectiveness of the SD method through two case studies (the behavior of alanine dipeptide and short-chain alkanes in aqueous solution). The results show that the SD method has good consistency with traditional MD methods in both equilibrium and dynamic predictions. - **Performance Improvement**: For the studied systems, the SD method is approximately two orders of magnitude faster than traditional MD methods. ### Challenges and Future Directions - **Model Training Cost**: The SD method requires a large amount of MD trajectory data for training, which is costly. - **High-Dimensional Distribution Sampling**: Even with known exact transition probabilities, sampling from high-dimensional distributions remains a challenge. - **Further Optimization**: Future research will further optimize the SD method, including improving the model architecture and reducing the need for training data. In summary, the paper proposes a new machine learning method—ScoreDynamics—to address the issues of time scale extension and rare event sampling in molecular dynamics simulations, demonstrating its potential in improving simulation efficiency and generalization ability.