Machine learning-aided generative molecular design
Yuanqi Du,Arian R. Jamasb,Jeff Guo,Tianfan Fu,Charles Harris,Yingheng Wang,Chenru Duan,Pietro Liò,Philippe Schwaller,Tom L. Blundell
DOI: https://doi.org/10.1038/s42256-024-00843-5
IF: 23.8
2024-06-19
Nature Machine Intelligence
Abstract:Machine learning has provided a means to accelerate early-stage drug discovery by combining molecule generation and filtering steps in a single architecture that leverages the experience and design preferences of medicinal chemists. However, designing machine learning models that can achieve this on the fly to the satisfaction of medicinal chemists remains a challenge owing to the enormous search space. Researchers have addressed de novo design of molecules by decomposing the problem into a series of tasks determined by design criteria. Here we provide a comprehensive overview of the current state of the art in molecular design using machine learning models as well as important design decisions, such as the choice of molecular representations, generative methods and optimization strategies. Subsequently, we present a collection of practical applications in which the reviewed methodologies have been experimentally validated, encompassing both academic and industrial efforts. Finally, we draw attention to the theoretical, computational and empirical challenges in deploying generative machine learning and highlight future opportunities to better align such approaches to achieve realistic drug discovery end points.
computer science, artificial intelligence, interdisciplinary applications