Alignment is Key for Applying Diffusion Models to Retrosynthesis

Najwa Laabid,Severi Rissanen,Markus Heinonen,Arno Solin,Vikas Garg

2024-05-28

Abstract:Retrosynthesis, the task of identifying precursors for a given molecule, can be naturally framed as a conditional graph generation task. Diffusion models are a particularly promising modelling approach, enabling post-hoc conditioning and trading off quality for speed during generation. We show mathematically that permutation equivariant denoisers severely limit the expressiveness of graph diffusion models and thus their adaptation to retrosynthesis. To address this limitation, we relax the equivariance requirement such that it only applies to aligned permutations of the conditioning and the generated graphs obtained through atom mapping. Our new denoiser achieves the highest top-$1$ accuracy ($54.7$\%) across template-free and template-based methods on USPTO-50k. We also demonstrate the ability for flexible post-training conditioning and good sample quality with small diffusion step counts, highlighting the potential for interactive applications and additional controls for multi-step planning.

Machine Learning,Quantitative Methods

What problem does this paper attempt to address?

This paper mainly discusses how to effectively apply diffusion models in retrosynthesis tasks. Retrosynthesis is the process of determining the precursors of a given molecule, which is crucial for chemical synthesis planning. The study found that standard graph diffusion models have limitations in retrosynthesis due to the use of permutation equivariant denoisers, which restrict their expressive power in retrosynthesis. To address this issue, the paper proposes the concept of "aligned permutation equivariance". By leveraging atom mapping information, the permutations between reactants and products are aligned, maintaining equivariance only when atom mapping remains unchanged. This approach improves the accuracy of the model, and the newly proposed denoiser achieves the highest accuracy (54.7%) in both template-free and template-based methods on the USPTO-50k dataset. Furthermore, the paper demonstrates the model's ability to incorporate conditional information after training, such as completing and optimizing reaction properties, making the model suitable for interactive applications and multi-step planning. The paper emphasizes the importance of correctly applying diffusion models in graph conditional problems and proposes various alignment implementation methods that significantly improve the generation quality of directly applying graph diffusion models. In summary, this paper addresses the limitations of using standard graph diffusion models in retrosynthesis tasks and proposes a new aligned permutation equivariant denoiser, enhancing the flexibility, accuracy, and adaptability of the model.

Alignment is Key for Applying Diffusion Models to Retrosynthesis

RetroDiff: Retrosynthesis as Multi-stage Distribution Interpolation

RetroPrime: A Chemistry-Inspired and Transformer-based Method for Retrosynthesis Predictions

A Graph to Graphs Framework for Retrosynthesis Prediction

Mind the Retrosynthesis Gap: Bridging the divide between Single-step and Multi-step Retrosynthesis Prediction

Learning Graph Models for Retrosynthesis Prediction

Graph Diffusion Transformers for Multi-Conditional Molecular Generation

RetroPrime: A Diverse, Plausible and Transformer-based Method for Single-Step Retrosynthesis Predictions

Decomposing Retrosynthesis into Reactive Center Prediction and Molecule Generation

Retrosynthesis prediction using an end-to-end graph generative architecture for molecular graph editing

Dissecting Errors in Machine Learning for Retrosynthesis: A Granular Metric Framework and Transformer-Based Model for More Informative Predictions

Permutation invariant graph-to-sequence model for template-free retrosynthesis and reaction prediction

Retrosynthesis Prediction with an Iterative String Editing Model

G 2 Retro as a two-step graph generative models for retrosynthesis prediction

Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation

Navigating the Design Space of Equivariant Diffusion-Based Generative Models for De Novo 3D Molecule Generation

RetroXpert: Decompose Retrosynthesis Prediction like a Chemist

RetroBridge: Modeling Retrosynthesis with Markov Bridges

Assessing the Extrapolation Capability of Template-Free Retrosynthesis Models

Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis

Diffusing on Two Levels and Optimizing for Multiple Properties: A Novel Approach to Generating Molecules with Desirable Properties