Compositional design of compounds with elements not in training data using supervised learning

Jingjin He,Ruowei Yin,Changxin Wang,Chuanbao Liu,Dezhen Xue,Yanjing Su,Lijie Qiao,Turab Lookman,Yang Bai
DOI: https://doi.org/10.1016/j.jmat.2024.06.008
IF: 8.589
2024-07-15
Journal of Materiomics
Abstract:Highlights • Machine learning accelerated compositional design of compounds with elements not in training data. • Machine learning model predicts accurately only if unknown element features fall within the range of training set values. • Prediction error rises with the Euclidean distance from a testing sample to its nearest training sample in feature space. An issue of current interest in the use of machine learning models to predict compositions of materials is their reliability in predicting outcomes with elements not included in the training data. We show that the phase diagram of the ceramic (Ba 1− x − y Ca x Sr y )(Ti 1− u − v − w Zr u Sn v Hf w )O 3 can be accurately predicted if the feature values of unknown elements do not exceed the range of values for existing elements in the training data. In particular, we employ physical features as descriptors and compositions as weights to show that by excluding an element, such as Zr, Sn or Hf from the training set and treating it as an unknown element, the machine learning model accurately predicts the property only if the feature values of the unknown element does not exceed the range of values of existing elements in the training set. By adding a small amount of data for the unknown element restores the prediction accuracy. We demonstrate this for BaTiO 3 ceramics doped with rare earth elements where the prediction accuracy is restored if the physical feature space is suitably enlarged with training data. The prediction error increases with the Euclidean distance of the testing sample relative to the nearest training sample in the physical feature space. Our work provides an effective strategy for extending machine learning models for material compositions beyond the scope of available data. Graphical abstract Download : Download high-res image (400KB) Download : Download full-size image
materials science, multidisciplinary,physics, applied,chemistry, physical
What problem does this paper attempt to address?