Assessment of embedding schemes in a hybrid machine learning/classical potentials (ML/MM) scheme

Juan Santiago Grassano,Jonathan Alexis Semelak,Ignacio Javier Pickering,Mariano Camilo Gonzalez Lebrero,Roitberg Adrian,Estrin Dario Ariel
DOI: https://doi.org/10.26434/chemrxiv-2023-9r018
2023-11-22
Abstract:Machine Learning (ML) methods have reached high accuracy levels for the prediction of in vacuo molecular properties. However, the simulation of large systems through solely ML methods (like those based on neural network potentials) is still a challenge. In this context, one of the most promising frameworks for integrating ML schemes in the simulation of complex molecular systems are the so-called ML/MM methods. These multiscale approaches combine ML methods with classical forcefields (MM), in the same spirit as the succesful hybrid quantum mechanics-molecular mechanics methods (QM/MM). The key issue for such ML/MM methods is the adequate description of the coupling between the region of the system described by ML and the region described at the MM level. In the context of QM/MM schemes, the main ingredient of the interaction is electrostatic, and the state of the art is the so called electrostatic-embedding. In this study, we analyze the quality of simpler mechanical embedding-based approaches, specifically focusing on their application within a ML/MM framework utilizing atomic partial charges derived in vacuo. Taking as reference electrostatic embedding calculations performed at a QM(DFT)/MM level, we explore different atomic charges schemes, as well as a polarization correction computed using atomic polarizabilites. Our benchmark data set comprises a set of about 80k small organic structures from the ANI-1x database, solvated in water. The results suggest that the MBIS atomic charges yield the best agreement with the reference coupling energy. Remarkable enhancements are achieved by including a simple polarization correction.
Chemistry
What problem does this paper attempt to address?
This paper discusses how to effectively combine machine learning (ML) methods with classical force fields (MM) in molecular simulations to solve simulation problems of large and complex systems. Specifically, the focus of the research is on evaluating the performance of different embedding schemes in hybrid ML/MM frameworks, particularly Mechanical Embedding (ME) methods and Polarizable Mechanical Embedding (ME+P) methods. The paper compares and analyzes various atomic partial charge schemes and introduces polarization corrections to improve the accuracy of mechanical embedding. In traditional quantum mechanics/molecular mechanics (QM/MM) methods, electrostatic embedding is the most commonly used coupling method, which uses QM electronic density to calculate interactions between QM and MM regions. However, mechanical embedding does not consider the influence of MM systems on QM energy and forces; instead, it preassigns atomic charges to estimate coupling energies. In order to approach the quality of electrostatic embedding, some methods attempt to include MM systems within the ML framework or use atomic polarizability to correct energies. In this paper, the researchers used a database of around 80,000 small organic structures dissolved in water as a benchmark test set, with DFT electrostatic embedding calculations as a reference, to study the effects of different atomic charge schemes and polarization corrections. The results demonstrated that the MBIS atomic charge scheme was most consistent with the reference coupling energy, and the accuracy of the results could be significantly improved by a simple polarization correction. In conclusion, the paper aims to improve ML/MM methods, particularly by optimizing mechanical embedding schemes and introducing polarization effects, to more accurately simulate large and complex molecular systems, such as solvent effects and molecular interactions in enzymatic catalysis and other biochemical processes.