RetroGNN: Fast Estimation of Synthesizability for Virtual Screening and De Novo Design by Learning from Slow Retrosynthesis Software
Cheng-Hao Liu,Maksym Korablyov,Stanisław Jastrzębski,Paweł Włodarczyk-Pruszyński,Yoshua Bengio,Marwin Segler,Stanisław Jastrzębski,Paweł Włodarczyk-Pruszyński
DOI: https://doi.org/10.1021/acs.jcim.1c01476
IF: 6.162
2022-04-22
Journal of Chemical Information and Modeling
Abstract:De novo molecule design algorithms often result in chemically unfeasible or synthetically inaccessible molecules. A natural idea to mitigate this problem is to bias these algorithms toward more easily synthesizable molecules using a proxy score for synthetic accessibility. However, using currently available proxies can still result in highly unrealistic compounds. Here, we propose a novel approach, RetroGNN, to estimate synthesizability. First, we search for routes using synthesis planning software for a large number of random molecules. This information is then used to train a graph neural network to predict the outcome of the synthesis planner given the target molecule, in which the regression task can be used as a synthesizability scorer. We highlight how RetroGNN can be used in generative molecule-discovery pipelines together with other scoring functions. We evaluate our approach on several QSAR-based molecule design benchmarks, for which we find synthesizable molecules with state-of-the-art scores. Compared to the virtual screening of 5 million existing molecules from the ZINC database, using RetroGNNScore with a simple fragment-based de novo design algorithm finds molecules predicted to be more likely to possess the desired activity exponentially faster, while maintaining good druglike properties and being easier to synthesize. Importantly, our deep neural network can successfully filter out hard to synthesize molecules while achieving a 10<sup>5</sup> times speedup over using retrosynthesis planning software.
chemistry, multidisciplinary, medicinal,computer science, interdisciplinary applications, information systems