Learning symmetry-aware atom mapping in chemical reactions through deep graph matching

Maryam Astero,Juho Rousu
DOI: https://doi.org/10.1186/s13321-024-00841-0
2024-04-24
Journal of Cheminformatics
Abstract:Accurate atom mapping, which establishes correspondences between atoms in reactants and products, is a crucial step in analyzing chemical reactions. In this paper, we present a novel end-to-end approach that formulates the atom mapping problem as a deep graph matching task. Our proposed model, AMNet (Atom Matching Network), utilizes molecular graph representations and employs various atom and bond features using graph neural networks to capture the intricate structural characteristics of molecules, ensuring precise atom correspondence predictions. Notably, AMNet incorporates the consideration of molecule symmetry, enhancing accuracy while simultaneously reducing computational complexity. The integration of the Weisfeiler-Lehman isomorphism test for symmetry identification refines the model's predictions. Furthermore, our model maps the entire atom set in a chemical reaction, offering a comprehensive approach beyond focusing solely on the main molecules in reactions. We evaluated AMNet's performance on a subset of USPTO reaction datasets, addressing various tasks, including assessing the impact of molecular symmetry identification, understanding the influence of feature selection on AMNet performance, and comparing its performance with the state-of-the-art method. The result reveals an average accuracy of 97.3% on mapped atoms, with 99.7% of reactions correctly mapped when the correct mapped atom is within the top 10 predicted atoms.
chemistry, multidisciplinary,computer science, interdisciplinary applications, information systems
What problem does this paper attempt to address?
The paper aims to address the problem of atom mapping in chemical reactions. Specifically, the study proposes a novel end-to-end approach to establish precise atom correspondences between reactants and products through deep graph matching techniques. The main contributions of the paper include: 1. **Proposing an atom mapping model (AMNet) based on deep graph matching**: Utilizing molecular graph representations to capture molecular structural features and processing these graph data through Graph Neural Networks (GNN) to achieve efficient and accurate atom mapping. 2. **Enhancing atom mapping accuracy through symmetry detection**: Introducing the Weisfeiler-Lehman test to identify molecular symmetry, reducing the number of possible mappings and improving the accuracy and efficiency of atom mapping. 3. **Comprehensively mapping all atoms in the entire chemical reaction**: Unlike traditional methods that only focus on major components, the new model can map all atoms in both reactants and products, providing a more complete perspective. 4. **Reducing computational complexity**: By integrating efficient graph matching techniques and symmetry consideration strategies, the model reduces the computational complexity issues common in traditional atom mapping methods. Experimental results show that AMNet achieves an average accuracy of 97.3% on the USPTO reaction dataset, and 99.7% of reactions are correctly mapped when the correctly mapped atoms are within the top 10 predicted atoms. This demonstrates the superior performance of the model in the atom mapping task.