Mothra: Multi-objective de novo Molecular Generation using Monte Carlo Tree Search

Masakazu Sekijima,Nobuaki Yasuo,Takamasa Suzuki,Dian Ma
DOI: https://doi.org/10.26434/chemrxiv-2024-4719t
2024-04-02
Abstract:In the field of drug discovery, identifying compounds that satisfy multiple criteria, such as target protein affinity, pharmacokinetics, and membrane permeability, is challenging because of the vast chemical space. Until now, multi-objective optimization using generative models has often involved linear combinations of different reward functions, turning multi-objective optimization into a single-objective task and causing problems with weighting for each individual objective. Herein we propose a scalable multi-objective molecular generative model developed using deep learning techniques. This model integrates the capabilities of recurrent neural networks for molecular generation and Pareto multi-objective Monte Carlo tree search to determine the optimal search direction. Through this integration, our model can generate compounds using enhanced evaluation functions that include important aspects like target protein affinity, drug similarity, and toxicity. The proposed model addresses the limitations of previous linear combination methods, and its effectiveness is demonstrated via extensive experimentation. The improvements achieved in the evaluation metrics underscore the potential utility of our approach toward drug discovery applications. In addition, we provide the source code for our model such that researchers can easily access and use our framework in their own investigations. The source code is available at https://github.com/sekijima-lab/Mothra.
Chemistry
What problem does this paper attempt to address?
This paper proposes a new method called Mothra for multi-objective de novo molecule generation, specifically addressing challenges in drug discovery. The traditional drug discovery process is time-consuming, expensive, and has a low success rate. Mothra aims to solve the problem of searching for compounds that satisfy multiple criteria, such as target protein affinity, pharmacokinetics, and membrane permeability, in the vast chemical space. Mothra combines deep learning techniques, particularly recursive neural networks (RNNs) and Pareto multi-objective Monte Carlo tree search (MOMCTS), to determine the optimal search direction. It generates compounds through an enhanced evaluation function that considers key factors such as target protein affinity, drug similarity, and toxicity. Unlike methods that linearly combine reward functions, Mothra resolves the weight allocation problem and can better handle multi-objective optimization. The challenges mentioned in the paper include how to simultaneously optimize multiple conflicting indicators such as drug potency, safety, and production cost. Mothra uses Pareto optimization to avoid converting multiple evaluation criteria into a single evaluation function, overcoming this problem. By considering the Pareto front during the simulation and backpropagation process, Mothra can find solutions in the generated molecules that balance various performance indicators. Experimental results demonstrate that Mothra successfully generated multi-objective optimized molecules for specific target proteins and provides the source code on GitHub for other researchers to use. By comparing with other single-objective optimization methods, Mothra exhibits advantages in generating compounds that meet multiple requirements.