The importance of reaction energy in predicting chemical reaction barriers with machine learning models

Joseph Gauthier,Aayush Singh,Nithin Lalith
DOI: https://doi.org/10.26434/chemrxiv-2023-t6zrj-v2
2024-03-01
Abstract:Improving our fundamental understanding of complex heterocatalytic processes increasingly relies on electronic structure simulations and microkinetic models based on calculated energy differences. In particular, calculation of activation barriers, usually achieved through compute-intensive saddle point search routines, remains a serious bottleneck in understanding trends in catalytic activity for highly branched reaction networks. Although the well-known Brønsted-Evans-Polyani (BEP) scaling – a one-dimensional linear regression model – has been widely applied in such microkinetic models, they still rely on calculated reaction energies and may not generalize beyond a single facet on a single class of materials, e.g., a terrace sites on transition metals. For highly branched and energetically shallow reaction networks, such as electrochemical CO2 reduction or waste remediation, calculating even reaction energies on many surfaces can become computationally intractable due to the combinatorial explosion of states that must be considered. Here, we investigate the feasibility of activation barrier prediction without knowledge of the reaction energy using linear and nonlinear machine learning (ML) models trained on a new database of over 500 dehydrogenation activation barriers. We and find that inclusion of the reaction energy significantly improves both classes of ML models, but complex nonlinear models can achieve performance similar to the simplest BEP scaling when predicting activation barriers on new systems. Additionally, inclusion of the reaction energy significantly improves generalizability to new systems beyond the training set. Our results suggest that the reaction energy is a critical feature to consider when building models to predict activation barriers, indicating that efforts to reliably predict reaction energies reliably through, e.g., the Open Catalyst Project and others, will be an important route to effective model development for more complex systems.
Chemistry
What problem does this paper attempt to address?
The problem discussed in this paper is how to predict the activation energy barrier of chemical reactions without relying on reaction energy calculations. Currently, the calculation of activation energy is a major bottleneck in understanding catalytic activity trends, especially in highly branched reaction networks. Although the Brønsted-Evans-Polanyi (BEP) linear regression model is commonly used in microkinetic models, these models still require the calculation of reaction energy and may not generalize well to multiple aspects of different materials. For complex reaction networks such as electrochemical CO2 reduction or waste treatment, computing the reaction energy of multiple surfaces becomes computationally infeasible due to the exponentially increasing combinatorial states that need to be considered. The study investigates the possibility of predicting activation energy barriers without relying on knowledge of reaction energy by building a new database that includes both linear and non-linear machine learning (ML) models. The results show that although complex non-linear models that do not include reaction energy can achieve performance similar to the simplest BEP scale, the inclusion of reaction energy significantly improves the predictive accuracy and generalization ability of the model to new systems. This suggests that reaction energy is a key feature when constructing models for predicting activation energy barriers, and efforts to reliably predict reaction energy (such as the Open Catalyst Project) will be an important pathway for developing effective models for more complex systems.