Predicting Synthesizability using Machine Learning on Databases of Existing Inorganic Materials

Ruiming Zhu,Siyu Isaac Parker Tian,Zekun Ren,Jiali Li,Tonio Buonassisi,Kedar Hippalgaonkar
DOI: https://doi.org/10.1021/acsomega.2c04856
IF: 4.1
2023-03-14
ACS Omega
Abstract:Defining the metric for synthesizability and predicting new compounds that can be experimentally realized in the realm of data-driven research is a pressing problem in contemporary materials science. The increasing computational power and advancements in machine learning (ML) algorithms provide a new avenue to solve the synthesizability challenge. In this work, using the Inorganic Crystal Structure Database (ICSD) and the Materials Project (MP) database, we represent crystal structures in...
chemistry, multidisciplinary
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to solve an urgent problem in the field of materials science: defining a metric for synthetic feasibility and predicting which new materials can be synthesized experimentally. Specifically, using machine - learning techniques and based on the existing inorganic materials database, the paper proposes a method that can predict the synthetic feasibility of materials. ### Background and challenges In contemporary materials science, being able to predict whether a material can be synthesized experimentally is an important task. This is related not only to the structure, phase state, and composition of the material, but also involves the bridge between theoretical research and experimental synthesis. Traditionally, researchers use the trial - and - error method to find material candidates, which usually combines empirical knowledge and theoretical calculations. However, due to the large differences in human experience and material selection, the discovery and synthesis process of new materials is often slow and unpredictable. Therefore, an accurate method is needed to determine the synthetic feasibility of materials to improve the efficiency and productivity of new material discovery. ### Solutions To meet this challenge, the authors use the Inorganic Crystal Structure Database (ICSD) and the Materials Project Database (MP), and propose a new crystal structure representation method - Fourier - Transform Crystal Properties (FTCP) representation, and use a deep - learning model to predict the synthetic feasibility of materials. The specific steps are as follows: 1. **Data preparation**: Obtain training and validation data sets from the ICSD and MP databases. 2. **Crystal structure representation**: Convert the crystal structure into a computer - readable form, including real - space features and reciprocal - space features. 3. **Model construction**: Build a deep - learning model that processes the crystal structure representation and outputs a Synthetic Feasibility Score (SC). 4. **Model training and validation**: Use data before 2015 for training and test on data after 2015 to evaluate the performance of the model. ### Main contributions - **High - precision prediction**: The model achieves an accuracy rate of 82.6% and a recall rate of 80.6% in predicting the synthetic feasibility of ternary crystal materials. - **Time - dependent validation**: By training and testing data from different time periods, the effectiveness of the model in predicting the synthetic feasibility of newly added materials is verified. - **Prediction of unexplored materials**: Provide a list of 100 new materials predicted to have high synthetic potential that have not been experimentally verified in the MP database. ### Conclusion By combining crystal structure representation and deep - learning techniques, this paper successfully establishes a model that can predict the synthetic feasibility of materials efficiently and accurately. This method not only helps to accelerate the discovery of new materials, but also provides strong support for future material screening and discovery.