A prediction model for CO2/CO adsorption performance on binary alloys based on machine learning

Xiaofeng Cao,Wenjia Luo,Huimin Liu
DOI: https://doi.org/10.1039/d4ra00710g
IF: 4.036
2024-04-17
RSC Advances
Abstract:Despite the rapid development of computational methods, including density functional theory (DFT), predicting the performance of a catalytic material merely based on its atomic arrangements remains challenging. Although quantum mechanics-based methods can model 'real' materials with dopants, grain boundaries, and interfaces with acceptable accuracy, the high demand for computational resources no longer meets the needs of modern scientific research. On the other hand, Machine Learning (ML) method can accelerate the screening of alloy-based catalytic materials. In this study, an ML model was developed to predict the CO 2 and CO adsorption affinity on single-atom doped binary alloys based on the thermochemical properties of component metals. By using a greedy algorithm, the best combination of features was determined, and the ML model was trained and verified based on a data set containing 78 alloys on which the adsorption energy values of CO 2 and CO were calculated from DFT. Comparison between predicted and DFT calculated adsorption energy values suggests that the extreme gradient boosting (XGBoost) algorithm has excellent generalization performance, and the R -squared ( R 2 ) for CO 2 and CO adsorption energy prediction are 0.96 and 0.91, respectively. The errors of predicted adsorption energy are 0.138 eV and 0.075 eV for CO 2 and CO, respectively. This model can be expected to advance our understanding of structure–property relationships at the fundamental level and be used in large-scale screening of alloy-based catalysts.
chemistry, multidisciplinary
What problem does this paper attempt to address?
The paper aims to address the following issues: 1. **Adsorption Energy Prediction Challenge**: Despite the rapid development of computational methods such as Density Functional Theory (DFT), predicting the performance of catalytic materials solely based on the atomic arrangement of materials remains challenging. Although quantum mechanical methods can simulate "real" materials (including dopants, grain boundaries, and interfaces) with acceptable accuracy, these methods no longer meet the demands of modern scientific research due to their high computational resource requirements. 2. **Machine Learning Accelerated Screening**: The paper utilizes machine learning (ML) methods to accelerate the screening process of alloy-based catalytic materials. Specifically, an ML model was developed to predict the adsorption affinity of carbon dioxide (CO₂) and carbon monoxide (CO) on single-atom doped binary alloys, and it was trained based on the thermochemical properties of the constituent metals. 3. **Feature Selection and Model Optimization**: By using a greedy algorithm to determine the optimal feature combination, the ML model was trained and validated on a dataset containing 78 alloys. Experimental results show that the Extreme Gradient Boosting (XGBoost) algorithm has excellent generalization performance, with coefficients of determination (R²) of 0.96 and 0.91 for the adsorption energy predictions of CO₂ and CO, respectively, and prediction errors of 0.138 eV and 0.075 eV, respectively. 4. **Understanding Structure-Performance Relationships**: The model is expected to advance our understanding of structure-performance relationships at a fundamental level and be used for large-scale screening of alloy-based catalysts. In summary, the goal of the paper is to improve the prediction capability of adsorption performance of catalytic materials through machine learning methods, thereby accelerating the discovery of novel alloy catalytic materials.