ML‐driven models for predicting CO2 uptake in metal–organic frameworks (MOFs)

Sofiene Achour,Zied Hosni
DOI: https://doi.org/10.1002/cjce.25509
2024-10-03
The Canadian Journal of Chemical Engineering
Abstract:This study advances the discourse on the application of machine learning (ML) algorithms for the predictive analysis of CO2 uptake in metal–organic frameworks (MOFs), with a nuanced focus on the CATBoost model's capability to navigate the complexities inherent in MOFs' heterogeneous landscape. Building upon and extending the comparative analysis, our investigation underscores the CATBoost model's remarkable prediction robustness, characterized by a significant reduction in root mean square error (RMSE) and an enhanced R‐squared (R2) value, thereby affirming its superior accuracy and reliability in forecasting CO2 adsorption. A pivotal aspect of our research is the integration of SHapley Additive exPlanations (SHAP) values for a detailed assessment of feature importance, which not only corroborated 'pressure' and 'surface area' as pivotal determinants of CO2 uptake but also illuminated the model's advanced analytical capabilities in handling categorical features and mitigating overfitting, even within a dataset marked by intricate and non‐linear patterns. Our quantitative and conceptual analysis, showcasing up to a 15% improvement in RMSE over previous models, reveals the CATBoost model's unparalleled efficiency in discerning the multifaceted interplay of factors influencing CO2 adsorption. This is crucial for the strategic engineering of MOFs with optimized properties. Beyond 'pressure' and 'surface area', our SHAP analysis highlighted other descriptors with substantial values, elucidating their contributions to CO2 uptake and providing invaluable insights for the MOF design process.
engineering, chemical
What problem does this paper attempt to address?