Machine Learning Prediction of CO Adsorption Energies and Properties of Layered Alloys Using an Improved Feature Selection Algorithm

Tao-Tao Shi,Gao-Yong Liu,Zhao-Xu Chen
DOI: https://doi.org/10.1021/acs.jpcc.2c09020
2023-01-01
Abstract:Layered alloys are widely studiedas designable catalysts. As asurface probe molecule, CO adsorption energy is not only employedto characterize surface properties but also used as the catalyticactivity descriptor in various reactions. With the aid of high-throughputcomputing technology, we calculated CO adsorption energies on 3729layeredalloy surfaces. To obtain CO adsorption energies, the d-band centerand d-band skewness, and the stability of all the remaining layeredalloys (8415) of 23 transition metals, we collected 91 features thatdo not require time-consuming quantum chemistry calculations (non-QCfeatures) and 40 features from quantum chemistry calculations (QCfeatures). To reduce the feature dimension and overcome overfittingproblems, we proposed a modified sequential feature selection (SFS)wrapper method to identify (sub)-optimal subsets. Two supervised lightgradient boosting machine regression (LGBMR) machine learning (ML)regression models were established using the identified subsets. Itis demonstrated that the size of the feature subset converges rapidly,and the performance of the model with size nine is already quite satisfactory.The ML model of the non-QC features outperforms that of QC features.Using the ML models established with non-QC features, we predictedthe CO adsorption energies and the electronic structure-properties(d-band center, d-band skewness) and stability of 8415 layered alloys.Based on the four conditions (CO adsorption energy, stability, price,and surface segregation), potential alloy catalysts for CO2 to methanol were screened out of 12144 layered alloys.
What problem does this paper attempt to address?