Machine learning prediction of accurate atomization energies of organic molecules from low-fidelity quantum chemical calculations

Logan Ward,Ben Blaiszik,Ian Foster,Rajeev S. Assary,Badri Narayanan,Larry Curtiss
DOI: https://doi.org/10.1557/mrc.2019.107
2019-09-01
MRS Communications
Abstract:Recent studies illustrate how machine learning (ML) can be used to bypass a core challenge of molecular modeling: the trade-off between accuracy and computational cost. Here, we assess multiple ML approaches for predicting the atomization energy of organic molecules. Our resulting models learn the difference between low-fidelity, B3LYP, and high-accuracy, G4MP2, atomization energies and predict the G4MP2 atomization energy to 0.005 eV (mean absolute error) for molecules with less than nine heavy atoms (training set of 117,232 entries, test set 13,026) and 0.012 eV for a small set of 66 molecules with between 10 and 14 heavy atoms. Our two best models, which have different accuracy/speed trade-offs, enable the efficient prediction of G4MP2-level energies for large molecules and are available through a simple web interface.
materials science, multidisciplinary
What problem does this paper attempt to address?