REINVENT4: Modern AI–Driven Generative Molecule Design

Hannes Loeffler,Jiazhen He,Alessandro Tibo,Jon Paul Janet,Alexey Voronov,Lewis Mervin,Ola Engkvist
DOI: https://doi.org/10.26434/chemrxiv-2023-xt65x
2023-11-06
Abstract:REINVENT4 is a modern open–source generative AI framework for the design of small molecules. The software utilizes recurrent neural networks and transformer architectures to drive molecule generation. These generators are seamlessly embedded within the general machine learning optimization algorithms transfer learning, reinforcement learning and curriculum learning. REINVENT4 enables and facilitates de novo design, R-group replacement, library design, linker design, scaffold hopping and molecule optimization. This contribution gives an overview of the software and describes its design. Algorithms and their applications are discussed in detail. REINVENT4 is a command line tool which reads a user configuration in either TOML or JSON format. The aim of this release is to provide reference implementations for some of the most common algorithms in AI based molecule generation. An additional goal with the release is to create a framework for education and future innovation in AI based molecular design. The software is available from https://github.com/ MolecularAI/REINVENT4 and released under the permissive Apache 2.0 license.
Chemistry
What problem does this paper attempt to address?
The paper is primarily dedicated to addressing the issue of generative artificial intelligence (Generative AI) in molecular design and specifically introduces REINVENT 4, a modern open-source generative AI framework. The main problems the paper attempts to solve are as follows: 1. **Design and Implementation of Molecular Generation Algorithms**: REINVENT 4 utilizes recurrent neural networks (RNN) and transformer architectures for molecular generation. These generators are seamlessly integrated into general machine learning optimization algorithms, including transfer learning, reinforcement learning, and curriculum learning. 2. **Support for Molecular Design Tasks**: REINVENT 4 supports various tasks such as de novo design, R-group replacement, library design, linker design, scaffold hopping, and molecule optimization. 3. **Promotion of Education and Innovation in Algorithms**: By providing reference implementations of some common generative molecular design algorithms, it promotes education and future innovation in generative AI within the field of drug discovery. The software is released under the Apache 2.0 license, aiming to provide an open-source framework to accelerate research progress in molecular design. 4. **Solving the Inverse Design Problem**: The paper discusses how to solve the inverse design problem using reinforcement learning techniques, i.e., generating new molecules based on desired properties. This involves optimizing the properties of existing molecules so that they meet specific criteria, such as target affinity, selectivity, physicochemical properties, etc. In summary, REINVENT 4 aims to provide a powerful and flexible tool to support AI-based molecular design work in drug discovery and other related fields.