Pathfinder -- Navigating and Analyzing Chemical Reaction Networks with an Efficient Graph-based Approach

Paul L. Türtscher,Markus Reiher
DOI: https://doi.org/10.1021/acs.jcim.2c01136
2022-12-16
Abstract:While the field of first-principles explorations into chemical reaction space has been continuously growing, the development of strategies for analyzing resulting chemical reaction networks (CRNs) is lagging behind. A CRN consists of compounds linked by reactions. Analyzing how these compounds are transformed into one another based on kinetic modeling is a nontrivial task. Here, we present the graph-optimization-driven algorithm and program Pathfinder to allow for such an analysis of a CRN. The CRN for this work has been obtained with our open-source Chemoton reaction network exploration software. Chemoton probes reactive combinations of compounds for elementary steps and sorts them into reactions. By encoding these reactions of the CRN as a graph consisting of compound and reaction vertices and adding information about activation barriers as well as required reagents to the edges of the graph yields a complete graph-theoretical representation of the CRN. Since the probabilities of the formation of compounds depend on the starting conditions, the consumption of any compound during a reaction must be accounted for to reflect the availability of reagents. To account for this, we introduce compound costs to reflect compound availability. Simultaneously, the determined compound costs rank the compounds in the CRN in terms of their probability to be formed. This ranking then allows us to probe easily accessible compounds in the CRN first for further explorations into yet unexplored terrain. We first illustrate the working principle on an abstract small CRN. Afterward, Pathfinder is demonstrated in the example of the disproportionation of iodine with water and the comproportionation of iodic acid and hydrogen iodide. Both processes are analyzed within the same CRN which we construct with our autonomous first-principles CRN exploration software Chemoton guided by Pathfinder.
Chemical Physics
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to address the challenges in the analysis of Chemical Reaction Networks (CRNs). Specifically: 1. **Complexity of Chemical Reaction Networks**: - Chemical reaction networks are formed by compounds connected through reactions. The high connectivity of these networks makes it extremely difficult to evaluate how specific compounds are formed. - A compound may be formed through multiple reaction paths, and each path consists of a series of reactions. Therefore, determining which path is most frequently followed is a complex problem. 2. **Limitations of Existing Methods**: - Although microkinetic modeling can solve this problem, when a large number of coupled ordinary differential equations need to be considered, the process of simulating the concentration flow is very time - consuming. - Existing graph - theory - based methods (such as Dijkstra's algorithm and Yen's algorithm) also have limitations when dealing with large - scale CRNs, especially when considering the stoichiometric requirements of reactions. 3. **The Proposed New Method**: - The authors propose an algorithm and program based on graph optimization - Pathfinder - for analyzing chemical reaction networks. - Pathfinder identifies the shortest path between two compounds by representing the CRN as a graph and encoding the kinetic and stoichiometric information of reactions in the graph. - This method not only considers the activation energy barrier of reactions but also the availability of reactants, thus predicting the formation path of compounds more accurately. ### Specific Problems - **How to Determine the Most Probable Reaction Path**: - By constructing a graph network, Pathfinder can identify the shortest path between two compounds. Here, "shortest" means that the total weight of the path is the minimum, and the weight includes the activation energy barrier of the reaction and the cost of reactants. - This method can be used to guide the exploration of CRNs. In particular, in reactions with high synthesis costs or high risks, it can predict whether the target product can be formed and the possible by - products without relying on kinetic modeling. - **How to Handle Stoichiometric Requirements**: - Pathfinder encodes the stoichiometric information required for reactions in the edges of the graph to ensure that the consumption of reactants is considered during the path - searching process. - For example, for the bimolecular reaction \(A + A\leftrightarrow B\), the edge in the graph will record that two equal amounts of \(A\) are required. - **How to Handle Initial Conditions**: - Pathfinder allows users to define initial conditions, that is, which compounds are available at the start of the reaction. These initial conditions will affect the cost of compounds, thereby affecting the choice of paths. - Through an iterative algorithm, Pathfinder can self - consistently determine the cost of each compound, reflecting its formation probability under the given initial conditions. In conclusion, by introducing the Pathfinder algorithm, this paper provides an efficient and comprehensive method for analyzing chemical reaction networks, solving the limitations of existing methods in dealing with complex CRNs.