Accelerated Design for Perovskite-Oxide-Based Photocatalysts Using Machine Learning Techniques

Xiuyun Zhai,Mingtong Chen
DOI: https://doi.org/10.3390/ma17123026
IF: 3.4
2024-06-21
Materials
Abstract:The rapid discovery of photocatalysts with desired performance among tens of thousands of potential perovskites represents a significant advancement. To expedite the design of perovskite-oxide-based photocatalysts, we developed a model of ABO3-type perovskites using machine learning methods based on atomic and experimental parameters. This model can be used to predict specific surface area (SSA), a key parameter closely associated with photocatalytic activity. The model construction involved several steps, including data collection, feature selection, model construction, web-service development, virtual screening and mechanism elucidation. Statistical analysis revealed that the support vector regression model achieved a correlation coefficient of 0.9462 for the training set and 0.8786 for the leave-one-out cross-validation. The potential perovskites with higher SSA than the highest SSA observed in the existing dataset were identified using the model and our computation platform. We also developed a webserver of the model, freely accessible to users. The methodologies outlined in this study not only facilitate the discovery of new perovskites but also enable exploration of the correlations between the perovskite properties and the physicochemical features. These findings provide valuable insights for further research and applications of perovskites using machine learning techniques.
materials science, multidisciplinary,chemistry, physical,physics, applied, condensed matter,metallurgy & metallurgical engineering
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the issue of how to accelerate the design of perovskite oxide photocatalysts with the desired specific surface area (SSA). Specifically: 1. **Data Collection**: The authors collected a large amount of data on ABO₃-type perovskite materials synthesized via the sol-gel method from published literature. 2. **Feature Engineering**: By removing highly correlated features, the input feature set was optimized, ultimately retaining 23 features. 3. **Model Construction**: A model to predict the specific surface area of perovskite materials was established using the Support Vector Regression (SVR) algorithm, and hyperparameters were optimized through grid search and cross-validation. 4. **Web Service Development**: An online service platform was developed, allowing users to quickly and efficiently predict the specific surface area of ABO₃-type perovskite materials. 5. **Virtual Screening**: Potential ABO₃-type perovskite materials with high specific surface area were identified through virtual screening methods. 6. **Mechanism Exploration**: Key factors affecting the specific surface area of ABO₃-type perovskite materials were explored to provide guidance for material design. The main contributions of this study include: - Compiling a dataset of 99 ABO₃-type perovskite samples. - Developing a Support Vector Regression model based on the Radial Basis Function (RBF), which showed high accuracy on the training set (correlation coefficient R of 0.9462) and performed well in Leave-One-Out Cross-Validation (LOOCV) (correlation coefficient R of 0.8786). - Determining key factors affecting the specific surface area through forward and backward selection methods. - Identifying ABO₃-type perovskite materials with high potential for specific surface area. - Providing an online prediction service to facilitate quick and efficient specific surface area predictions for researchers.