Improving Molecular Modeling with Geometric GNNs: an Empirical Study

Ali Ramlaoui,Théo Saulus,Basile Terver,Victor Schmidt,David Rolnick,Fragkiskos D. Malliaros,Alexandre Duval
2024-07-11
Abstract:Rapid advancements in machine learning (ML) are transforming materials science by significantly speeding up material property calculations. However, the proliferation of ML approaches has made it challenging for scientists to keep up with the most promising techniques. This paper presents an empirical study on Geometric Graph Neural Networks for 3D atomic systems, focusing on the impact of different (1) canonicalization methods, (2) graph creation strategies, and (3) auxiliary tasks, on performance, scalability and symmetry enforcement. Our findings and insights aim to guide researchers in selecting optimal modeling components for molecular modeling tasks.
Machine Learning
What problem does this paper attempt to address?
The paper primarily explores the application of Geometric Graph Neural Networks (GNNs) in molecular modeling and evaluates the impact of different methods on model performance, scalability, and symmetry execution through empirical studies. Specifically, the paper attempts to address the following aspects: 1. **Comparison of Different Normalization Methods**: The researchers evaluated recently proposed normalization methods in ensuring or approximating Euclidean symmetry for handling 3D atomic systems. The experimental results indicate that approximate normalization methods (such as Stochastic Frame Averaging, SFA) can provide performance comparable to or even better than exact symmetry methods in certain practical application scenarios. 2. **Impact of Graph Construction Strategies**: The authors studied the impact of the graph construction process (i.e., how to create graphs based on the distances between atoms) and rewiring strategies on model performance. The results show that selecting an appropriate distance threshold is crucial when constructing geometric graphs, as too small or too large a threshold can affect model performance. Additionally, physics-inspired modules (such as Ewald-based methods) are effective for simple models like SchNet but do not significantly improve more complex models like FAENet. 3. **Role of Auxiliary Tasks**: The paper also explores how to leverage other tasks to improve FAENet's performance on specific tasks (such as IS2RE). Introducing Noisy Nodes as an auxiliary task can alleviate the over-smoothing problem and allow for the use of more interaction layers, thereby enhancing model performance. Furthermore, pre-trained models can improve performance across different tasks, demonstrating the knowledge transfer capability between atomic property prediction tasks. In summary, this paper aims to provide guidance to researchers, helping them choose the best modeling components based on specific task requirements to optimize the performance of geometric GNNs in molecular modeling tasks.