Abstract:We develop a new deep potential—range correction (DPRc) machine learning potential for combined quantum mechanical/molecular mechanical (QM/MM) simulations of chemical reactions in the condensed phase. The new range correction enables short-ranged QM/MM interactions to be tuned for higher accuracy, and the correction smoothly vanishes within a specified cutoff. We further develop an active learning procedure for robust neural network training. We test the DPRc model and training procedure against a series of six nonenzymatic phosphoryl transfer reactions in solution that are important in mechanistic studies of RNA-cleaving enzymes. Specifically, we apply DPRc corrections to a base QM model and test its ability to reproduce free-energy profiles generated from a target QM model. We perform these comparisons using the MNDO/d and DFTB2 semiempirical models because they differ in the way they treat orbital orthogonalization and electrostatics and produce free-energy profiles which differ significantly from each other, thereby providing us a rigorous stress test for the DPRc model and training procedure. The comparisons show that accurate reproduction of the free-energy profiles requires correction of the QM/MM interactions out to 6 Å. We further find that the model's initial training benefits from generating data from temperature replica exchange simulations and including high-temperature configurations into the fitting procedure, so the resulting models are trained to properly avoid high-energy regions. A single DPRc model was trained to reproduce four different reactions and yielded good agreement with the free-energy profiles made from the target QM/MM simulations. The DPRc model was further demonstrated to be transferable to 2D free-energy surfaces and 1D free-energy profiles that were not explicitly considered in the training. Examination of the computational performance of the DPRc model showed that it was fairly slow when run on CPUs but was sped up almost 100-fold when using NVIDIA V100 GPUs, resulting in almost negligible overhead. The new DPRc model and training procedure provide a potentially powerful new tool for the creation of next-generation QM/MM potentials for a wide spectrum of free-energy applications ranging from drug discovery to enzyme design.The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jctc.1c00201.Training errors (energy unit: eV, force unit: eV/Å) for the native reaction with different ML parameter sets; ratio of frames with different model deviations in the final simulations for the native reaction with different ML parameter sets; comparison of target (DFTB2) and model (MNDO/d+ML) transition-state coordinate values; transition-state (TS) barrier height and reaction coordinate values for the 2D study of the native reaction, with DFTB2 and MNDO/d Hamiltonians and ML models trained only on 1D reaction; training errors and histogram of frame energies; forces in the QM region; forces in the MM region; free-energy profiles of different variants, compared with original DFTB2 and MNDO/d curves; and illustration of scaling of the wall clock time per simulation step (ms/step) observed in the DFTB2 QM/MM simulations of the native reaction with and without the use of DPRc corrections CPU and GPU timings plotted separately(PDF)This article has not yet been cited by other publications.

chemtrain: Learning Deep Potential Models via Automatic Differentiation and Statistical Physics

Training Transferable Interatomic Neural Network Potentials for Reactive Chemistry: Improved Chemical Space Sampling

Refining Potential Energy Surface through Dynamical Properties via Differentiable Molecular Simulation

Force Training Neural Network Potential Energy Surface Models

Scalable Training of Neural Network Potentials for Complex Interfaces Through Data Augmentation

Molecular Dynamics with Neural-Network Potentials

Ab initio Accuracy Neural Network Potential for Drug-like Molecules

Deepks: A Comprehensive Data-Driven Approach Toward Chemically Accurate Density Functional Theory

Introduction to machine learning potentials for atomistic simulations

Neural Network Potential with Multi-Resolution Approach Enables Accurate Prediction of Reaction Free Energies in Solution

The Potential of Neural Network Potentials

Extending the atomic decomposition and many-body representation, a chemistry-motivated monomer-centered approach for machine learning potentials

Transfer learning for chemically accurate interatomic neural network potentials

Strategies for the Construction of Machine-Learning Potentials for Accurate and Efficient Atomic-Scale Simulations

Deep Potential Molecular Dynamics: A Scalable Model with the Accuracy of Quantum Mechanics

Implicit Delta Learning of High Fidelity Neural Network Potentials

Improving the reliability of machine learned potentials for modeling inhomogenous liquids

Development of Range-Corrected Deep Learning Potentials for Fast, Accurate Quantum Mechanical/Molecular Mechanical Simulations of Chemical Reactions in Solution

De novo exploration and self-guided learning of potential-energy surfaces

Efficient Training of Neural Network Potentials for Chemical and Enzymatic Reactions by Continual Learning