DrugGen: Advancing Drug Discovery with Large Language Models and Reinforcement Learning Feedback

Mahsa Sheikholeslami,Navid Mazrouei,Yousof Gheisari,Afshin Fasihi,Matin Irajpour,Ali Motahharynia

2024-11-20

Abstract:Traditional drug design faces significant challenges due to inherent chemical and biological complexities, often resulting in high failure rates in clinical trials. Deep learning advancements, particularly generative models, offer potential solutions to these challenges. One promising algorithm is DrugGPT, a transformer-based model, that generates small molecules for input protein sequences. Although promising, it generates both chemically valid and invalid structures and does not incorporate the features of approved drugs, resulting in time-consuming and inefficient drug discovery. To address these issues, we introduce DrugGen, an enhanced model based on the DrugGPT structure. DrugGen is fine-tuned on approved drug-target interactions and optimized with proximal policy optimization. By giving reward feedback from protein-ligand binding affinity prediction using pre-trained transformers (PLAPT) and a customized invalid structure assessor, DrugGen significantly improves performance. Evaluation across multiple targets demonstrated that DrugGen achieves 100% valid structure generation compared to 95.5% with DrugGPT and produced molecules with higher predicted binding affinities (7.22 [6.30-8.07]) compared to DrugGPT (5.81 [4.97-6.63]) while maintaining diversity and novelty. Docking simulations further validate its ability to generate molecules targeting binding sites effectively. For example, in the case of fatty acid-binding protein 5 (FABP5), DrugGen generated molecules with superior docking scores (FABP5/11, -9.537 and FABP5/5, -8.399) compared to the reference molecule (Palmitic acid, -6.177). Beyond lead compound generation, DrugGen also shows potential for drug repositioning and creating novel pharmacophores for existing targets. By producing high-quality small molecules, DrugGen provides a high-performance medium for advancing pharmaceutical research and drug discovery.

Quantitative Methods,Artificial Intelligence

What problem does this paper attempt to address?

### Problems the paper attempts to solve This paper aims to solve several key challenges in traditional drug design: 1. **High failure rate**: Traditional drug design methods have significant difficulties in dealing with chemical and biological complexity, leading to a high failure rate of candidate drugs in clinical trials. 2. **Generation of invalid structures**: Existing generative models (such as DrugGPT) can generate small molecules, but these molecules include both chemically valid and invalid structures, which leads to inefficiency and time - consuming in the drug discovery process. 3. **Lack of characteristics of approved drugs**: Existing models fail to fully consider the characteristics of approved drugs, which limits their ability to generate high - quality drug candidates. To solve these problems, the authors propose an enhanced generative model - DrugGen. DrugGen is based on the DrugGPT architecture and has been improved by the following methods: - **Supervised fine - tuning**: Fine - tuning is carried out using a dataset of approved drug - target pairs to improve the quality of the generated molecules. - **Reinforcement learning optimization**: The Proximal Policy Optimization (PPO) algorithm is used, combined with Protein - Ligand Binding Affinity Prediction (PLAPT) and a custom - made invalid structure evaluator, to further optimize the model performance. ### Main contributions 1. **Generate valid and diverse small molecules**: Evaluations of DrugGen on multiple targets show that it can generate 100% valid structures, and the generated molecules have a higher predicted binding affinity (7.22 [6.30 - 8.07]) compared to DrugGPT (5.81 [4.97 - 6.63]). 2. **Improve binding affinity**: Verified by molecular docking simulation, the molecules generated by DrugGen can target binding sites more effectively. For example, in the case of Fatty Acid - Binding Protein 5 (FABP5), the generated molecules have higher docking scores (FABP5/11, - 9.537 and FABP5/5, - 8.399) compared to the reference molecule (palmitic acid, - 6.177). 3. **Potential for drug re - positioning**: DrugGen not only performs well in generating new drug candidates, but also shows potential in drug re - positioning and creating new pharmacophores for existing targets. ### Conclusion As a powerful early - stage drug discovery tool, DrugGen is expected to significantly accelerate the identification process of new lead compounds. Through further optimization and experimental verification, DrugGen has the potential to become an important part of the future drug discovery pipeline and develop new drugs for the treatment of various diseases.

DrugGen: Advancing Drug Discovery with Large Language Models and Reinforcement Learning Feedback

De novo drug design as GPT language modeling: large chemistry models with supervised and reinforcement learning

TD-GPT: Target Protein-Specific Drug Molecule Generation GPT

MTMol-GPT: De novo multi-target molecular generation with transformer-based generative adversarial imitation learning

Guided Multi-objective Generative AI to Enhance Structure-based Drug Design

Optimizing Drug Design by Merging Generative AI With Active Learning Frameworks

Tailoring Molecules for Protein Pockets: a Transformer-based Generative Solution for Structured-based Drug Design

TamGen: Drug Design with Target-Aware Molecule Generation Through a Chemical Language Model

Structure-Based de Novo Molecular Generator Combined with Artificial Intelligence and Docking Simulations

Target-aware Molecule Generation for Drug Design Using a Chemical Language Model

A Transformer-based Generative Model for De Novo Molecular Design

Generative artificial intelligence for small molecule drug design

Generative AI for Drug Discovery: A GPT-2 and LSTM Based Models for Designing EGFR Inhibitors

De Novo Drug Design Using Transformer-Based Machine Translation and Reinforcement Learning of an Adaptive Monte Carlo Tree Search

De novo Drug Design using Reinforcement Learning with Multiple GPT Agents

Accelerating Discovery of Novel and Bioactive Ligands With Pharmacophore-Informed Generative Models

Exploiting Pretrained Biochemical Language Models for Targeted Drug Design

Comprehensive assessment of deep generative architectures for de novo drug design

Fragment-Based Ligand Generation Guided By Geometric Deep Learning On Protein-Ligand Structure

Target Specific De Novo Design of Drug Candidate Molecules with Graph Transformer-based Generative Adversarial Networks

Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning