Unsupervised learning of progress coordinates during weighted ensemble simulations: Application to millisecond protein folding

Jeremy Leung,Nicolas Frazee,Alex Brace,Arvind Ramanathan,Lillian Chong
DOI: https://doi.org/10.1101/2024.08.28.610178
2024-08-30
bioRxiv
Abstract:Our method identifies outliers in a latent space model of the system's sampled conformations that is periodically trained using a convolutional variational autoencoder. As a proof of principle, we applied our DL-enhanced WE method to simulate a millisecond protein folding process. To enable rapid tests, our simulations propagated discrete-state synthetic molecular dynamics trajectories using a generative, fine-grained Markov state model. Results revealed that our ″on-the-fly″ DL of outliers enhanced the efficiency of WE by >3-fold in estimating the folding rate constant. Our efforts are a significant step forward in the unsupervised learning of slow coordinates during rare event sampling.
What problem does this paper attempt to address?