Abstract:De novo molecular design is the process of learning knowledge from existing data to propose new chemical structures that satisfy the desired properties. By using de novo design to generate compounds in a directed manner, better solutions can be obtained in large chemical libraries with less comparison cost. But drug design needs to take multiple factors into consideration. For example, in polypharmacology, molecules that activate or inhibit multiple target proteins produce multiple pharmacological activities and are less susceptible to drug resistance. However, most existing molecular generation methods either focus only on affinity for a single target or fail to effectively balance the relationship between multiple targets, resulting in insufficient validity and desirability of the generated molecules. To address the problems, an approach called clustered Pareto-based reinforcement learning (CPRL) is proposed. In CPRL, a pre-trained model is constructed to grasp existing molecular knowledge in a supervised learning manner. In addition, the clustered Pareto optimization algorithm is presented to find the best solution between different objectives. The algorithm first extracts an update set from the sampled molecules through the designed aggregation-based molecular clustering. Then, the final reward is computed by constructing the Pareto frontier ranking of the molecules from the updated set. To explore the vast chemical space, a reinforcement learning agent is designed in CPRL that can be updated under the guidance of the final reward to balance multiple properties. Furthermore, to increase the internal diversity of the molecules, a fixed-parameter exploration model is used for sampling in conjunction with the agent. The experimental results demonstrate that CPRL is capable of balancing multiple properties of the molecule and has higher desirability and validity, reaching 0.9551 and 0.9923, respectively.

De Novo Drug Design by Multi-Objective Path Consistency Learning with Beam A∗ Search

De novo drug design by iterative multiobjective deep reinforcement learning with graph-based molecular quality assessment

Molecular generation strategy and optimization based on A2C reinforcement learning in de novo drug design

De novo drug design based on Stack-RNN with multi-objective reward-weighted sum and reinforcement learning

Generation of Rational Drug-Like Molecular Structures Through a Multiple-Objective Reinforcement Learning Framework

Multi-objective molecular generation via clustered Pareto-based reinforcement learning

Utilizing reinforcement learning for de novo drug design

Multi-Objective De Novo Drug Design with Conditional Graph Generative Model

De novo Drug Design using Reinforcement Learning with Multiple GPT Agents

Widely Used and Fast De Novo Drug Design by a Protein Sequence-Based Reinforcement Learning Model

De novo generation of dual-target ligands using adversarial training and reinforcement learning

LEADD: Lamarckian evolutionary algorithm for de novo drug design

Diversity-Aware Reinforcement Learning for de novo Drug Design

De novo drug design using reinforcement learning with graph-based deep generative models

Faster and more diverse de novo molecular optimization with double-loop reinforcement learning using augmented SMILES

Comprehensive assessment of deep generative architectures for de novo drug design

Status and Prospects of Research on Deep Learning-based De Novo Generation of Drug Molecules

Diversity oriented Deep Reinforcement Learning for targeted molecule generation

Human-in-the-loop assisted de novo molecular design

Guided Multi-objective Generative AI to Enhance Structure-based Drug Design