Schedule Optimization for Chemical Library Synthesis

Qianxiang Ai,Fanwang Meng,Runzhong Wang,J. Cullen Klein,Alexander G. Godfrey,Connor W. Coley
DOI: https://doi.org/10.26434/chemrxiv-2024-0n73z
2024-11-12
Abstract:Automated chemistry platforms hold the potential to enable large-scale organic synthesis campaigns, such as producing a library of compounds for biological evaluation. The efficiency of such platforms will depend on the schedule according to which the synthesis operations are executed. In this work, we study the scheduling problem for chemical library synthesis, where operations from interdependent synthetic routes are scheduled to minimize the makespan—the total duration of the synthesis campaign. We formalize this problem as a flexible job-shop scheduling problem with chemistry-relevant constraints in the form of a mixed integer linear program (MILP), which we then solve in order to design an optimized schedule. The scheduler's ability to produce valid, optimal schedules is demonstrated by 720 simulated scheduling instances for realistically accessible chemical libraries. Reductions in makespan up to 73%, with an average reduction of 38%, are observed compared to the baseline scheduling approach.
Chemistry
What problem does this paper attempt to address?
This paper attempts to solve the scheduling problem in chemical library synthesis. Specifically, it is to optimize the schedule for performing synthesis operations to minimize the total duration (i.e., makespan) of the entire synthesis activity. In an automated chemistry platform, when synthesizing a series of compounds for biological evaluation, the execution order and timing of operations are crucial for efficiency. Therefore, the research focus of this paper is to reduce the time required to complete the entire chemical library synthesis by reasonably arranging the operations in the interdependent synthesis paths. ### Specific Problem Description 1. **Background and Challenges**: - Automated chemistry platforms are capable of synthesizing compound libraries on a large scale, such as producing a series of compounds for biological evaluation. - Synthesis efficiency depends on the way operations are scheduled, that is, how to arrange these operations to complete all tasks in the shortest time. 2. **Research Objectives**: - This paper studies the scheduling problem in chemical library synthesis, aiming to minimize the total synthesis time (makespan) through optimized scheduling. - In view of the characteristics of chemical synthesis, the author formalizes this problem as a mixed - integer linear programming (MILP) problem with chemical - related constraints and solves it to design an optimal scheduling scheme. 3. **Key Issues**: - There are dependencies between operations: some operations can only start after other operations are completed. - Time - lag constraints: there are minimum or maximum time - interval requirements between some operations. - Hardware module capacity limitations: some devices can only handle a limited number of operations at the same time. - Work - shift limitations: some operations can only be carried out during specific time periods. ### Solution The author proposes a MILP - based scheduling model that takes into account the above - mentioned various constraints to ensure that the generated scheduling scheme is both effective and feasible. Verified by 720 simulation examples, this model can reduce the makespan by up to 73% compared with the baseline scheduling method, with an average reduction of 38%. ### Summary The main contribution of this paper lies in providing a systematic framework to solve the complex scheduling problem in chemical library synthesis through mathematical optimization methods, significantly improving the synthesis efficiency. This is of great significance for research and industrial applications that require rapid and efficient synthesis of a large number of compounds.