Extracting medicinal chemistry intuition via preference machine learning

Oh-Hyeon Choung,Riccardo Vianello,Marwin Segler,Nikolaus Stiefl,José Jiménez-Luna
DOI: https://doi.org/10.1038/s41467-023-42242-1
IF: 16.6
2023-10-31
Nature Communications
Abstract:Abstract The lead optimization process in drug discovery campaigns is an arduous endeavour where the input of many medicinal chemists is weighed in order to reach a desired molecular property profile. Building the expertise to successfully drive such projects collaboratively is a very time-consuming process that typically spans many years within a chemist’s career. In this work we aim to replicate this process by applying artificial intelligence learning-to-rank techniques on feedback that was obtained from 35 chemists at Novartis over the course of several months. We exemplify the usefulness of the learned proxies in routine tasks such as compound prioritization, motif rationalization, and biased de novo drug design. Annotated response data is provided, and developed models and code made available through a permissive open-source license.
multidisciplinary sciences
What problem does this paper attempt to address?