Performance Assessment of Universal Machine Learning Interatomic Potentials: Challenges and Directions for Materials' Surfaces

Bruno Focassio,Luis Paulo Mezzina Freitas,Gabriel R. Schleder
2024-05-31
Abstract:Machine learning interatomic potentials (MLIPs) are one of the main techniques in the materials science toolbox, able to bridge ab initio accuracy with the computational efficiency of classical force fields. This allows simulations ranging from atoms, molecules, and biosystems, to solid and bulk materials, surfaces, nanomaterials, and their interfaces and complex interactions. A recent class of advanced MLIPs, which use equivariant representations and deep graph neural networks, is known as universal models. These models are proposed as foundational models suitable for any system, covering most elements from the periodic table. Current universal MLIPs (UIPs) have been trained with the largest consistent dataset available nowadays. However, these are composed mostly of bulk materials' DFT calculations. In this article, we assess the universality of all openly available UIPs, namely MACE, CHGNet, and M3GNet, in a representative task of generalization: calculation of surface energies. We find that the out-of-the-box foundational models have significant shortcomings in this task, with errors correlated to the total energy of surface simulations, having an out-of-domain distance from the training dataset. Our results show that while UIPs are an efficient starting point for fine-tuning specialized models, we envision the potential of increasing the coverage of the materials space towards universal training datasets for MLIPs.
Materials Science,Disordered Systems and Neural Networks,Computational Physics
What problem does this paper attempt to address?
The paper aims to evaluate the performance of Universal Machine Learning Interatomic Potentials (UIP) in the representative task of calculating material surface energies and to explore their potential as a base model for fine-tuning to adapt to specific tasks. The authors selected three publicly available UIP models—MACE, CHGNet, and M3GNet—and evaluated them. The study found significant deficiencies in these models when predicting material surface energies, especially when dealing with surface structures outside the domain of the training dataset. Although these models perform well in predicting the total energy of bulk materials, they exhibit large errors in surface energy predictions, and the errors are correlated with the total energy of the surface simulations. This indicates that while current UIPs can serve as an effective starting point for fine-tuning specialized models, their performance in surface energy prediction still needs improvement. Additionally, the paper compares the prediction accuracy of UIPs with that of specially trained interatomic potential models (such as MTP and NequIP) and finds that specially trained models have higher accuracy in predicting the surface energies of certain elements. By comparing the performance of different models, the paper suggests that UIPs can improve their prediction accuracy for surface structures by enhancing their generalization capabilities.