Enhancing interpretability in the exploration of high-energy conversion efficiency in CsSnBr3−xIx configurations using crystal graph convolutional neural networks and adversarial example methods

Tao Wang,Xiaolong Lai,Yadong Wei,Hong Guo,Hao Jin
DOI: https://doi.org/10.1007/s40843-023-2800-x
2024-03-14
Science China Materials
Abstract:Crystal graph convolutional neural networks (CGCNNs) have revolutionized materials research by eliminating the need for manual feature engineering. However, their lack of interpretability and sensitivity to structural distortions hinders their application in substitution engineering. Therefore, we propose an innovative adversarial example method for guiding feature construction and enhancing the interpretability of CGCNNs. In this study, our focus lies on identifying CsSnBr 3− x I x configurations with high-energy conversion efficiency. Initially, we train a CGCNN classifier as a benchmark. Subsequently, we perturb input data to generate a low-performance classifier and identify adversarial examples based on incorrect predictions. Upon comparing these examples with normal examples, we observe substantial structural distortions in adversarial cases, serving as inspiration for the creation of disorder-related features. Consequently, an interpretable model is developed, which surpasses CGCNNs with atomic position perturbations and a gradient-boosting classifier using general features. Notably, the previously overlooked feature "number of unequal atoms" plays an important role in offering crucial insights. Further analysis reveals that configurations with pronounced disorder can exhibit increased power density, thereby enhancing the energy conversion efficiency. Our work not only elucidates the impact of atom substitution on energy conversion efficiency but also provides a roadmap for constructing interpretable machine learning models.
materials science, multidisciplinary
What problem does this paper attempt to address?
The paper primarily aims to address the following issues: ### Research Background and Objectives - **Challenges in Material Screening and Design**: In the exploration of high-performance perovskite materials, traditional trial-and-error methods (such as Density Functional Theory (DFT) calculations and experiments) are time-consuming and costly. - **Application of Machine Learning**: Machine learning techniques can accelerate the discovery process of new materials, especially by using methods like Crystal Graph Convolutional Neural Networks (CGCNNs) to analyze crystal structure properties. - **Limitations of CGCNNs**: Although CGCNNs perform well in material research, their lack of interpretability and sensitivity to structural distortions limit their application in alternative engineering. ### Main Issues Addressed - **Enhancing the Interpretability of CGCNNs**: Propose an adversarial sample-based approach to enhance the feature construction and model interpretability of CGCNNs. - **Identifying High Energy Conversion Efficiency CsSnBr3−xIx Configurations**: The goal of the study is to identify CsSnBr3−xIx configurations with high energy conversion efficiency, with a particular focus on the substitution of bromine (Br) with iodine (I). - **Developing Interpretable Machine Learning Models**: By improving CGCNNs and utilizing adversarial samples, develop a machine learning model that surpasses traditional CGCNNs and is easier to understand. ### Method Overview - **Training the CGCNN Baseline Model**: First, train a CGCNN classifier as a baseline. - **Generating Adversarial Samples**: Generate adversarial samples by slightly perturbing the input data to create a lower-performing classifier and identify adversarial samples based on incorrect predictions. - **Feature Construction**: Compare adversarial samples with normal samples, finding that adversarial samples exhibit greater structural distortions, which inspires the construction of disorder-related features. - **Developing an Interpretable Model**: Based on these features, develop an interpretable machine learning model that not only outperforms the CGCNNs model with atomic position perturbations but also surpasses the Gradient Boosting Classifier (GBC) using general features. ### Results and Contributions - **Key Feature**: The feature "number of unequal atoms" is crucial for providing important insights. - **Performance Improvement**: The proposed interpretable model excels in predicting the Spectroscopic Limited Maximum Efficiency (SLME) of CsSnBr3−xIx configurations. - **Mechanism Revelation**: The study shows that configurations with significant disorder can exhibit higher power density, thereby improving energy conversion efficiency. In summary, this research aims to optimize alternative engineering strategies to discover high-performance perovskite materials by enhancing the interpretability of machine learning models.