Abstract:Molecular dynamics (MD) is an extremely powerful, highly effective, and widely used approach to understanding the nature of chemical processes in atomic details for proteins. The accuracy of results from MD simulations is highly dependent on force fields. Currently, molecular mechanical (MM) force fields are mainly utilized in MD simulations because of their low computational cost. Quantum mechanical (QM) calculation has high accuracy, but it is exceedingly time consuming for protein simulations. Machine learning (ML) provides the capability for generating accurate potential at the QM level without increasing much computational effort for specific systems that can be studied at the QM level. However, the construction of general machine learned force fields, needed for broad applications and large and complex systems, is still challenging. Here, general and transferable neural network (NN) force fields based on CHARMM force fields, named CHARMM-NN, are constructed for proteins by training NN models on 27 fragments partitioned from the residue-based systematic molecular fragmentation (rSMF) method. The NN for each fragment is based on atom types and uses new input features that are similar to MM inputs, including bonds, angles, dihedrals, and non-bonded terms, which enhance the compatibility of CHARMM-NN to MM MD and enable the implementation of CHARMM-NN force fields in different MD programs. While the main part of the energy of the protein is based on rSMF and NN, the nonbonded interactions between the fragments and with water are taken from the CHARMM force field through mechanical embedding. The validations of the method for dipeptides on geometric data, relative potential energies, and structural reorganization energies demonstrate that the CHARMM-NN local minima on the potential energy surface are very accurate approximations to QM, showing the success of CHARMM-NN for bonded interactions. However, the MD simulations on peptides and proteins indicate that more accurate methods to represent protein-water interactions in fragments and non-bonded interactions between fragments should be considered in the future improvement of CHARMM-NN, which can increase the accuracy of approximation beyond the current mechanical embedding QM/MM level.

Top-down machine learning of coarse-grained protein force-fields

Navigating protein landscapes with a machine-learned transferable coarse-grained model

Learning Effective Molecular Models from Experimental Observables.

Machine learning coarse-grained potentials of protein thermodynamics

A unified framework for coarse grained molecular dynamics of proteins with high-fidelity reconstruction

Machine Learning of coarse-grained Molecular Dynamics Force Fields

Consistent View of Protein Fluctuations from All-Atom Molecular Dynamics and Coarse-Grained Dynamics with Knowledge-Based Force-Field.

Machine learned coarse-grained protein force-fields: Are we there yet?

Two for One: Diffusion Models and Force Fields for Coarse-Grained Molecular Dynamics

Differentiable molecular simulation can learn all the parameters in a coarse-grained force field for proteins

Monte Carlo Sampling of Protein Folding by Combining an All-Atom Physics-Based Model with a Native State Bias

Toward a general neural network force field for protein simulations: Refining the intramolecular interaction in protein

Transferable Coarse Graining Via Contrastive Learning of Graph Neural Networks

Transferable coarse-grained potential for $\textit{de novo}$ protein folding and design

Simulate Time-integrated Coarse-grained Molecular Dynamics with Multi-Scale Graph Networks

Coarse Graining Molecular Dynamics with Graph Neural Networks

Machine learning for protein folding and dynamics

Protein Structure Prediction Using Coarse Grain Force Fields

Coarse-Grained Molecular Dynamics Study Based on TorchMD

Direct generation of protein conformational ensembles via machine learning

Learning effective amino acid interactions through iterative stochastic techniques