How Graph Neural Network Interatomic Potentials Extrapolate: Role of the Message-Passing Algorithm

Sungwoo Kang
2024-08-13
Abstract:Graph neural network interatomic potentials (GNN-IPs) are gaining significant attention due to their capability of learning from large datasets. Specifically, universal interatomic potentials based on GNN, usually trained with crystal geometries, often exhibit remarkable extrapolative behavior towards untrained domains, such as surfaces or amorphous configurations. However, the origin of this extrapolation capability is not well understood. This work provides a theoretical explanation of how GNN-IPs extrapolate to untrained geometries. First, we demonstrate that GNN-IPs can capture non-local electrostatic interactions through the message-passing algorithm, as evidenced by tests on a toy model and DFT data. We find that GNN-IPs accurately predict electrostatic forces in untrained domains, indicating that they have learned the exact functional form of the Coulomb interaction. Based on these results, we suggest that the ability to learn non-local electrostatic interactions, coupled with the embedding nature of GNN-IPs, explains their extrapolation ability. Finally, we find that the universal GNN-IP, SevenNet-0, effectively infers non-local Coulomb interactions in untrained domains.
Materials Science
What problem does this paper attempt to address?
The paper aims to address the issue of significant extrapolation capability of Graph Neural Network Interatomic Potentials (GNN-IPs) in untrained geometric structures. Specifically, the paper explores how GNN-IPs capture non-local electrostatic interactions through message-passing algorithms and explains how this ability enables GNN-IPs to perform excellently when extrapolating to untrained configurations. The core issue of the paper is to understand how GNN-IPs can effectively learn and predict electrostatic interactions beyond the range of training data. Through theoretical analysis and experimental validation, the paper demonstrates that GNN-IPs can accurately learn the form of Coulomb interactions, and this learning ability, combined with their embedding characteristics, explains their performance in extrapolating to untrained configurations. Additionally, the paper discusses the performance differences of various models (such as SevenNet and MACE) in handling non-local interactions and provides theoretical explanations.