Investigations into the efficiency of computer-aided synthesis planning

Peter B.R. Hartog,Annie M. Westerlund,Igor V. Tetko,Samuel Genheden

DOI: https://doi.org/10.26434/chemrxiv-2024-q2v87

2024-10-14

Abstract:Efficiency of machine learning (ML) models is crucial to minimize inference times and reduce carbon footprints of models deployed in production environments. Current models employed in retrosynthesis to generate a synthesis route from a target molecule to purchasable compounds are prohibitively slow. The model operates in single-step fashion in a tree search algorithm by predicting reactant molecules given a product molecule as input. In this study, we investigate the ability of alternative transformer architectures, knowledge distillation (KD) and simple hyper-parameter optimization to decrease inference times of the Chemformer model. Initially, we assess the ability of closely related transformer architectures and conclude that these models under-performed when using KD. Additionally, we investigate the effects of feature-based and response-based KD together with hyper-parameters optimized based on inference sample time and model accuracy. We find that although reducing model size and improving single-step speed are important, our results indicate that multi-step search efficiency is more significantly influenced by the diversity and confidence of single-step models. Based on this work, further research should use KD in combination with other techniques, as multi-step speed continues to prevent proper integration of synthesis planning. However, in Monte Carlo-based (MC) multi-step retrosynthesis, other factors play a crucial role in balancing exploration and exploitation during the search process, often outweighing the direct impact of single-step model speed and carbon footprints.

Chemistry

What problem does this paper attempt to address?

The problem this paper attempts to address is improving the efficiency of machine learning (ML) models in computer-aided synthesis planning tools. Specifically, the paper focuses on how to reduce the inference time of single-step reaction prediction models, thereby lowering the overall computational complexity and carbon footprint of multi-step synthesis route searches. Current models used for retrosynthetic analysis are very slow in generating synthesis routes from target molecules to purchasable compounds, which limits their application in real production environments. Therefore, the paper explores how to improve the inference speed of models while maintaining prediction accuracy by studying different Transformer architectures, knowledge distillation (KD) techniques, and hyperparameter optimization methods. Furthermore, it investigates whether these improvements can effectively translate into speed enhancements for multi-step retrosynthetic searches. The study finds that while reducing model size and improving single-step speed are important, the efficiency of multi-step searches is more influenced by the diversity and confidence of single-step models. Additionally, the paper emphasizes that in Monte Carlo-based multi-step retrosynthesis, the balance between exploration and exploitation is more crucial than the mere speed of single-step models.

Investigations into the efficiency of computer-aided synthesis planning

Learning to Plan Chemical Syntheses

Models Matter: The Impact of Single-Step Retrosynthesis on Synthesis Planning

Bridging Chemical Knowledge and Machine Learning for Performance Prediction of Organic Synthesis.

Do Chemformers Dream of Organic Matter? Evaluating a Transformer Model for Multistep Retrosynthesis

Challenging Complexity with Simplicity: Rethinking the Role of Single-Step Models in Computer-Aided Synthesis Planning

Dissecting Errors in Machine Learning for Retrosynthesis: A Granular Metric Framework and Transformer-Based Model for More Informative Predictions

Mind the Retrosynthesis Gap: Bridging the divide between Single-step and Multi-step Retrosynthesis Prediction

RetroPrime: A Chemistry-Inspired and Transformer-based Method for Retrosynthesis Predictions

Investigating the Reliability and Interpretability of Machine Learning Frameworks for Chemical Retrosynthesis

Prediction of Organic Reaction Outcomes Using Machine Learning

Chimera: Accurate retrosynthesis prediction by ensembling models with diverse inductive biases

RetroPrime: A Diverse, Plausible and Transformer-based Method for Single-Step Retrosynthesis Predictions

Planning chemical syntheses with deep neural networks and symbolic AI

Reagent prediction with a molecular transformer improves reaction data quality

Computer‐assisted synthetic planning considering reaction kinetics based on transition state automated generation method

Molecular Transformer - A Model for Uncertainty-Calibrated Chemical Reaction Prediction

Value-Added Chemical Discovery Using Reinforcement Learning

Evaluation of Reinforcement Learning in Transformer-based Molecular Design

A high-accuracy multi-model mixing retrosynthetic method

Machine Learning-Assisted Retrosynthesis Planning: Current Status and Future Prospects