QSPR models for solvation enthalpy based on quantum chemical descriptors
Xinliang Yu,Hanlu Wang,William E. Acree,Jiyong Deng,William E. Acree Jr.
DOI: https://doi.org/10.1016/j.molliq.2023.122884
IF: 6
2023-08-29
Journal of Molecular Liquids
Abstract:After generation of quantum chemical descriptors calculated with the GEDIIS/GDIIS optimizer in Gaussian 09, five quantum chemical descriptors related to molecular polarizability, atomic charges, charge product between solvent and solute, were selected as the optimal descriptor subset for developing quantitative structure–property relationship (QSPR) models of 7215 enthalpies of solvation and vaporization. The random forest (RF) algorithm was used to develop the RF Model I whose dataset division for training and testing was mainly based on the solvent types. The RF Model I has the number ( n ) of enthalpies of solvation and vaporization being 3633, coefficient of determination R 2 being 0.986, root mean square ( rms ) error being 2.598 kJ/mol for the training set and n = 3582, R 2 = 0.933, rms = 5.501 kJ/mol for the test set. The RF Model VI based on Kennard-Stone algorithm for dataset selection possesses n = 4810, R 2 = 0.987, rms = 2.550 kJ/mol (training set), n = 2405, R 2 = 0.940, rms = 4.659 kJ/mol (test set). These statistical results are very accurate, compared with other QSPR models on enthalpies of solvation reported in the literature. Furthermore, the RF Model I and VI based on large data sets can be used for predicting both of the solvation enthalpies and vaporization enthalpies.
chemistry, physical,physics, atomic, molecular & chemical