Investigations into the efficiency of computer-aided synthesis planning

Peter B.R. Hartog,Annie M. Westerlund,Igor V. Tetko,Samuel Genheden
DOI: https://doi.org/10.26434/chemrxiv-2024-q2v87
2024-10-14
Abstract:Efficiency of machine learning (ML) models is crucial to minimize inference times and reduce carbon footprints of models deployed in production environments. Current models employed in retrosynthesis to generate a synthesis route from a target molecule to purchasable compounds are prohibitively slow. The model operates in single-step fashion in a tree search algorithm by predicting reactant molecules given a product molecule as input. In this study, we investigate the ability of alternative transformer architectures, knowledge distillation (KD) and simple hyper-parameter optimization to decrease inference times of the Chemformer model. Initially, we assess the ability of closely related transformer architectures and conclude that these models under-performed when using KD. Additionally, we investigate the effects of feature-based and response-based KD together with hyper-parameters optimized based on inference sample time and model accuracy. We find that although reducing model size and improving single-step speed are important, our results indicate that multi-step search efficiency is more significantly influenced by the diversity and confidence of single-step models. Based on this work, further research should use KD in combination with other techniques, as multi-step speed continues to prevent proper integration of synthesis planning. However, in Monte Carlo-based (MC) multi-step retrosynthesis, other factors play a crucial role in balancing exploration and exploitation during the search process, often outweighing the direct impact of single-step model speed and carbon footprints.
Chemistry
What problem does this paper attempt to address?
The problem this paper attempts to address is improving the efficiency of machine learning (ML) models in computer-aided synthesis planning tools. Specifically, the paper focuses on how to reduce the inference time of single-step reaction prediction models, thereby lowering the overall computational complexity and carbon footprint of multi-step synthesis route searches. Current models used for retrosynthetic analysis are very slow in generating synthesis routes from target molecules to purchasable compounds, which limits their application in real production environments. Therefore, the paper explores how to improve the inference speed of models while maintaining prediction accuracy by studying different Transformer architectures, knowledge distillation (KD) techniques, and hyperparameter optimization methods. Furthermore, it investigates whether these improvements can effectively translate into speed enhancements for multi-step retrosynthetic searches. The study finds that while reducing model size and improving single-step speed are important, the efficiency of multi-step searches is more influenced by the diversity and confidence of single-step models. Additionally, the paper emphasizes that in Monte Carlo-based multi-step retrosynthesis, the balance between exploration and exploitation is more crucial than the mere speed of single-step models.