Data-driven automatic synthesis planning: Synthesis routes of S-Zanubrutinib identified with CDI CASP

Alexei Lapkin,Zhen Guo,Akihiro Takada
DOI: https://doi.org/10.26434/chemrxiv-2024-f0kcq
2024-12-03
Abstract:Advances in computer-assisted synthesis planning (CASP) are revolutionising how new functional molecules in many chemistry-using industries are being developed. CASP tools allow to assemble and analyse prior knowledge of a specified chemical system (a molecule, a reaction, a synthesis route), to generate hypotheses on experimental campaigns that could either be performed manually or using automated reaction systems. Advanced CASP tools are combining data science, chemoinformatics, machine learning and physical models-based predictive tools. Compared to expert-based synthesis planning, the power of CASP techniques allows for faster and more comprehensive planning, which could significantly improve the efficiencies of chemical process/product development. This White Paper describes a recent collaboration project between Shionogi & Co. Ltd. and Chemical Data Intelligence (CDI) Ltd. The CASP system developed by CDI (CDI-CASP) was tested in developing a new synthesis of S-Zanubrutinib, a drug for lymphoma treatment. Three types of search in CDI-CASP - “search synthesis routes”, “search analogue routes” and “search chiral reactions” - were iteratively applied for synthesis planning. Setting search criteria requires expert involvement. This ‘human in the middle’ interactive strategy leads to a shorter, greener, and more efficient synthesis route compared to the benchmark route filed in a patent.
Chemistry
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to develop a new synthetic route for S - Zanubrutinib through computer - aided synthesis planning (CASP) technology. Specifically, the research objectives include: 1. **Reduce the number of synthesis steps**: Look for shorter synthetic paths compared to the existing patented synthetic routes. 2. **Increase the overall yield**: Look for synthetic routes with a higher overall yield. 3. **Avoid hazardous reagents, solvents and intermediates**: Design synthetic routes that do not use hazardous substances. 4. **Explore potential renewable raw materials**: Use renewable resources as raw materials to achieve a more environmentally - friendly synthesis method. The paper describes the process of using the CDI - CASP system for synthetic route search. This system can perform three types of searches: - **Synthetic route search**: Starting from the target molecule, search for synthetic routes in reverse. - **Analog route search**: Based on similar chemical transformations in the literature, propose new synthetic ideas. - **Analog chiral reaction search**: Look for reported chiral synthesis reactions to support the synthesis design of chiral molecules. Through these methods, the research team hopes to find a shorter, more efficient, safer and more environmentally - friendly synthetic route for S - Zanubrutinib.