The best DFT functional is the ensemble of functionals

Pavlo O. Dral,Yuting Rui,Yuxinxin Chen,Elena Ivanova,Ireneusz Grabowski
DOI: https://doi.org/10.26434/chemrxiv-2024-2g7zr
2024-07-19
Abstract:The development of better density functional theory (DFT) methods is one of the most active research areas, given the importance of DFT for ubiquitous molecular and materials simulations. However, this research primarily focuses on improving a specific exchange-correlation Kohn–Sham density functional. Here, we propose a robust procedure for constructing transferable ensembles of density functionals that perform superior to any constituent individual density functional. We show that such ensembles built only with the density functionals predating the GMTKN55 benchmark can reach a record-low weighted error of 1.69 kcal/mol on this benchmark compared to 3.08 kcal/mol of the best constituent density functional. We also introduce the DENS24 density functional ensembles as practical DFT methods with consistently accurate performance for various simulations at affordable cost. DENS24 ensembles will be available open-source and for simulations online as described at https://xacs.xmu.edu.cn/docs/mlatom/tutorial_dens.html.
Chemistry
What problem does this paper attempt to address?
This paper mainly explores how to improve the accuracy of molecular and materials simulation by constructing a set of density functional theory (DFT) functions, called the density functional set (DENS). Currently, although there are various DFT methods available, finding a functional that performs superiorly in various system and property calculations remains a challenge. The paper proposes using ensemble learning methods to create a more accurate set by integrating the strengths of existing DFT functions. They design a model using linear regression to average the weights, reducing the random error of individual functions and maintaining robustness and size consistency in simulations. The researchers trained these sets on the GMTKN55 benchmark test set and demonstrated the DENS24 series, which outperforms any single constituent function in total ground state energy calculations. DENS24 not only reduces errors in the benchmark test set but also exhibits good transferability to unseen data. Furthermore, they discuss the practicality of the set in common simulation tasks such as geometry optimization, vibrational spectroscopy, thermochemistry, and molecular dynamics. The paper also mentions the implementation of the DENS24 method, including two series based on the Orca and PySCF programs, as well as their applications in different basis sets. The results show that these sets demonstrate higher accuracy and narrower error distribution than individual DFT methods in geometry optimization and other property predictions. In conclusion, this paper aims to address how to construct more accurate and widely applicable DFT methods through ensemble learning strategies to enhance the predictive ability of chemical and materials simulations.