A Data-Driven Accelerated Sampling Method for Searching Functional States of Proteins

Qiang Zhu,Yigao Yuan,Jing Ma,Hao Dong
DOI: https://doi.org/10.1002/adts.201800171
2019-01-01
Advanced Theory and Simulations
Abstract:Protein exhibits distinct characteristics in different functional states. The lack of structural information for proteins hinders the understanding of their function. Here, a data-driven accelerated (DA2) sampling method is proposed, which is capable of searching new functional states of protein from a known structure with high efficiency. The key function of DA2 sampling is to drive the conformational change of protein along its intrinsic motion without introducing biased potential/force, where principle component analysis is applied on-the-fly to reduce the highly redundant information generated by molecular dynamics simulations. In this work, the capacity and accuracy of DA2 sampling are validated by using alanine dipeptide. This protocol is then applied to search for the dosed state of N-terminal calmodulin (nCaM) from the open one. The identified structure resembles the crystal structure of nCaM in its dosed state, with a root-mean-square deviation between the two of only 1.8 angstrom. Interestingly, independent DA2 samplings disclose different open-to-dosed transition pathways for nCaM, which is likely to have implications for its biological functions. Therefore, DA2 sampling is expected to play important roles in exploring functional states of a broad spectrum of proteins at atomic level that are not easily determined experimentally.
What problem does this paper attempt to address?