Structure-based out-of-distribution (OOD) materials property prediction: a benchmark study

Sadman Sadeed Omee,Nihang Fu,Rongzhi Dong,Ming Hu,Jianjun Hu
2024-01-16
Abstract:In real-world material research, machine learning (ML) models are usually expected to predict and discover novel exceptional materials that deviate from the known materials. It is thus a pressing question to provide an objective evaluation of ML model performances in property prediction of out-of-distribution (OOD) materials that are different from the training set distribution. Traditional performance evaluation of materials property prediction models through random splitting of the dataset frequently results in artificially high performance assessments due to the inherent redundancy of typical material datasets. Here we present a comprehensive benchmark study of structure-based graph neural networks (GNNs) for extrapolative OOD materials property prediction. We formulate five different categories of OOD ML problems for three benchmark datasets from the MatBench study. Our extensive experiments show that current state-of-the-art GNN algorithms significantly underperform for the OOD property prediction tasks on average compared to their baselines in the MatBench study, demonstrating a crucial generalization gap in realistic material prediction tasks. We further examine the latent physical spaces of these GNN models and identify the sources of CGCNN, ALIGNN, and DeeperGATGNN's significantly more robust OOD performance than those of the current best models in the MatBench study (coGN and coNGN), and provide insights to improve their performance.
Materials Science,Machine Learning
What problem does this paper attempt to address?
The paper focuses on how machine learning (ML) models in material science can predict and handle out-of-distribution (OOD) properties of new materials. The study points out that the performance evaluation of existing ML models in predicting OOD material properties is often too optimistic, as they are typically evaluated during random splits of the dataset, which may lead to overestimation of performance due to sample redundancy in the material database. To address this issue, the paper proposes a benchmark study for OOD material property prediction using a structure-based Graph Neural Network (GNN). In the paper, the authors design five different types of OOD ML problems and conduct experiments on three datasets from the MatBench benchmark. The results show that the current state-of-the-art GNN algorithms perform significantly lower on average in these OOD tasks compared to the baseline models in the MatBench benchmark, revealing the generalization gap in practical material prediction tasks. By delving into the underlying physical space of these GNN models, the authors identify certain models (such as CGCNN, ALIGNN, and DeeperGATGNN) that are more robust in OOD performance compared to the best models in MatBench (coGN and coNGN), and provide insights for improving performance. The paper emphasizes that predicting properties of anomalous materials that differ from the training set distribution is often required in real-world material research, and existing ML models perform poorly in this aspect. Therefore, this work proposes a comprehensive benchmark to evaluate the performance of GNN in predicting material properties with different distributions, especially for rare or anomalous materials in the material database. Through performance comparisons on different OOD test sets, the paper demonstrates the need for improvement methods to enhance the performance of models in OOD prediction tasks, such as domain adaptation.