ANI Neural Networks Meet Electrostatics: A ML/MM Implementation in Amber

Jonathan A. Semelak,Pickering Ignacio,Kate K. Huddleston,Justo Olmos,Juan S Grassano,Camila Clemente,Salvador I. Drusin,Marcelo Marti,Mariano C. Gonzalez Lebrero,Adrian E. Roitberg,Dario A. Estrin
DOI: https://doi.org/10.26434/chemrxiv-2024-m9xgc
2024-10-03
Abstract:We present a novel integration of the ANI neural networks into the Amber software suite, offering a sophisticated machine learning/molecular mechanics (ML/MM) framework. The implementation is designed as a general-purpose tool for the simulation of neutral organic molecules, requiring no additional training for its use beyond the initial setup. The framework leverages a new ANI potential that accurately predicts geometry-dependent atomic partial charges at the Minimal Basis Iterative Stockholder (MBIS) level, enhancing the modeling of electrostatic interactions within ML/MM systems. Additionally, we incorporate a polarization correction to address the distortion effects on the ML subsystem from MM point charges. Our approach is validated through simulations of solvation profiles, vibrational spectra, and torsion free energy profiles of small molecules in aqueous environments, as well as protein-ligand interactions. Our findings demonstrate that this ML/MM framework can approximate QM/MM electrostatic embedding with significantly reduced computational demands, paving the way for more efficient and accurate simulations in computational chemistry.
Chemistry
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address the integration of ANI neural networks into the Amber software suite to provide an efficient and accurate machine learning/molecular mechanics (ML/MM) framework. Specifically, the paper focuses on the following aspects: 1. **Improving Simulation Accuracy**: By introducing new ANI potential functions, it can accurately predict geometry-dependent atomic partial charges based on the Minimal Basis Iterative Stockholder (MBIS) level, thereby enhancing the modeling of electrostatic interactions in ML/MM systems. 2. **Reducing Computational Cost**: The proposed method can approximate the electrostatic embedding effects of quantum mechanics/molecular mechanics (QM/MM) while significantly reducing computational demands, making the simulation of complex chemical and biological molecular systems more efficient. 3. **Generality**: The designed framework is a general tool applicable to the simulation of neutral organic molecules, which users can use without additional training, requiring only initial setup. 4. **Addressing Polarization Effects**: Polarization corrections are introduced to address the distortion effects from molecular mechanics point charges on the machine learning subsystem. ### Validation Methods To validate the effectiveness and accuracy of the framework, the authors conducted the following types of simulation experiments: 1. **Solvation Profiles**: Simulating the solvation profiles of small molecules in a water environment. 2. **Vibrational Spectra**: Calculating the vibrational spectra of small molecules in a water environment. 3. **Torsional Free Energy Profiles**: Analyzing the torsional free energy profiles of small molecules in a water environment. 4. **Protein-Ligand Interactions**: Evaluating the physical accuracy of protein-ligand interactions. ### Main Contributions - **New ANI-MBIS-q Model**: Predicts MBIS charges directly from the local chemical environment without the need for electron density. - **Efficient ML/MM Interface**: Implements the ANI neural network in Amber's SANDER engine, avoiding the cost of calling Python code each time. - **Wide Applicability**: The framework can be used for various systems without the need to train models from scratch. ### Conclusion Through these simulation experiments, the research results indicate that the proposed ML/MM framework can provide simulation accuracy comparable to QM/MM while significantly reducing computational demands, paving the way for efficient and accurate simulations in computational chemistry.