Two-Stage Feature Selection for Machine Learning-aided DFT-based Surface Reactivity Study on Single-Atom Alloys

Viejay Ordillo,Koji Shimizu,Darwin Barayang Putungan,Alexandra B. Santos-Putungan,Satoshi Watanabe,Rizalinda de Leon,Joey D Ocon,Karl Ezra Pilario,Allan Abraham Bustria Padama
DOI: https://doi.org/10.1088/1361-651x/ad53ee
IF: 2.421
2024-06-05
Modelling and Simulation in Materials Science and Engineering
Abstract:This paper presents a feature-centric strategy for predicting adsorption energies of key CO2 reduction reaction (CO2RR) adsorbates, CO and H species, utilizing DFT-based calculations for eight (8) adsorption sites and considering alloying effects of nine (9) transition metals at single-atom concentrations. Here, we explore a class of materials consisting of a majority host metal where individual atoms of a different element are dispersed called single-atom alloys (SAA). A total of eight (8) feature selection methods are assessed within Gradient Boosting Regression and Linear Regression models. This study proposes a practical and effective two-stage approach that narrows down the initial 86 features to subsets of 10 and 7 for CO and H adsorption energy predictions, respectively, with the arithmetic mean of valence electrons (VE-am) feature consistently emerging as highly influential, validated through permutation and Shapley additive explanations (SHAP)-based feature importance analyses. The models exhibit robust performance on unseen data, indicating their generalization capability. The findings emphasize VE-am as a potential key machine learning feature for CO2RR on SAA surfaces and underline the effectiveness of the feature-centric approach in understanding feature impacts in machine learning models for CO2RR on SAA systems. Additionally, while other features based on structural, electronic and elemental properties may not individually impact the model significantly, their collective contribution plays a vital role in achieving more accurate adsorption energy predictions.
materials science, multidisciplinary,physics, applied
What problem does this paper attempt to address?