Composition Based Oxidation State Prediction of Materials Using Deep Learning Language Models

Nihang Fu,Jeffrey Hu,Ying Feng,Gregory Morrison,Hans‐Conrad zur Loye,Jianjun Hu
DOI: https://doi.org/10.1002/advs.202301011
IF: 15.1
2023-08-09
Advanced Science
Abstract:Deep learning neural language model is used to learn the relationships of materials compositions and elemental oxidation states and exploit this deep dark knowledge for accurate prediction of atomic site oxidation states for materials given only their compositions. This model can be used to speed up high‐throughput generative discovery of new materials. Oxidation states (OS) are the charges on atoms due to electrons gained or lost upon applying an ionic approximation to their bonds. As a fundamental property, OS has been widely used in charge‐neutrality verification, crystal structure determination, and reaction estimation. Currently, only heuristic rules exist for guessing the oxidation states of a given compound with many exceptions. Recent work has developed machine learning models based on heuristic structural features for predicting the oxidation states of metal ions. However, composition‐based oxidation state prediction still remains elusive so far, which has significant implications for the discovery of new materials for which the structures have not been determined. This work proposes a novel deep learning‐based BERT transformer language model BERTOS for predicting the oxidation states for all elements of inorganic compounds given only their chemical composition. This model achieves 96.82% accuracy for all‐element oxidation states prediction benchmarked on the cleaned ICSD dataset and achieves 97.61% accuracy for oxide materials. It is also demonstrated how it can be used to conduct large‐scale screening of hypothetical material compositions for materials discovery.
materials science, multidisciplinary,nanoscience & nanotechnology,chemistry
What problem does this paper attempt to address?