Chemoenzymatic Synthesis Planning Guided by Reaction Type Score

Hongxiang Li,Xuan Liu,Guangde Jiang,Huimin Zhao
DOI: https://doi.org/10.1101/2024.10.25.620365
2024-10-29
Abstract:Thanks to the growing interests in computer-aided synthesis planning (CASP), a wide variety of retrosynthesis and retrobiosynthesis tools have been developed in the past decades. However, synthesis planning tools for multi-step chemoenzymatic reactions are still rare despite the widespread use of enzymatic reactions in chemical synthesis. Herein we report a reaction type score (RTscore)-guided chemoenzymatic synthesis planning (RTS-CESP) strategy. Briefly, the RTscore is trained using a text-based convolutional neural network (TextCNN) to distinguish synthesis reactions from decomposition reactions and evaluate synthesis efficiency. Once multiple chemical synthesis routes are generated by a retrosynthesis tool for a target molecule, RTscore is used to rank them and find the step(s) that can be replaced by enzymatic reactions to improve synthesis efficiency. As proof of concept, RTS-CESP was applied to 10 molecules with known chemoenzymatic synthesis routes in literature and was able to predict all of them with six being the top-ranked routes. Moreover, RTS-CESP was employed for 1000 molecules in the boutique database and was able to predict the chemoenzymatic synthesis routes for 554 molecules, outperforming ASKCOS, a state-of-the-art chemoenzymatic synthesis planning tool. Finally, RTS-CESP was used to design a new chemoenzymatic synthesis route for the FDA-approved drug Alclofenac, which was shorter than literature-reported route and have been experimentally validated.
Bioinformatics
What problem does this paper attempt to address?