DrugGen: Advancing Drug Discovery with Large Language Models and Reinforcement Learning Feedback

Mahsa Sheikholeslami,Navid Mazrouei,Yousof Gheisari,Afshin Fasihi,Matin Irajpour,Ali Motahharynia
2024-11-20
Abstract:Traditional drug design faces significant challenges due to inherent chemical and biological complexities, often resulting in high failure rates in clinical trials. Deep learning advancements, particularly generative models, offer potential solutions to these challenges. One promising algorithm is DrugGPT, a transformer-based model, that generates small molecules for input protein sequences. Although promising, it generates both chemically valid and invalid structures and does not incorporate the features of approved drugs, resulting in time-consuming and inefficient drug discovery. To address these issues, we introduce DrugGen, an enhanced model based on the DrugGPT structure. DrugGen is fine-tuned on approved drug-target interactions and optimized with proximal policy optimization. By giving reward feedback from protein-ligand binding affinity prediction using pre-trained transformers (PLAPT) and a customized invalid structure assessor, DrugGen significantly improves performance. Evaluation across multiple targets demonstrated that DrugGen achieves 100% valid structure generation compared to 95.5% with DrugGPT and produced molecules with higher predicted binding affinities (7.22 [6.30-8.07]) compared to DrugGPT (5.81 [4.97-6.63]) while maintaining diversity and novelty. Docking simulations further validate its ability to generate molecules targeting binding sites effectively. For example, in the case of fatty acid-binding protein 5 (FABP5), DrugGen generated molecules with superior docking scores (FABP5/11, -9.537 and FABP5/5, -8.399) compared to the reference molecule (Palmitic acid, -6.177). Beyond lead compound generation, DrugGen also shows potential for drug repositioning and creating novel pharmacophores for existing targets. By producing high-quality small molecules, DrugGen provides a high-performance medium for advancing pharmaceutical research and drug discovery.
Quantitative Methods,Artificial Intelligence
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to solve several key challenges in traditional drug design: 1. **High failure rate**: Traditional drug design methods have significant difficulties in dealing with chemical and biological complexity, leading to a high failure rate of candidate drugs in clinical trials. 2. **Generation of invalid structures**: Existing generative models (such as DrugGPT) can generate small molecules, but these molecules include both chemically valid and invalid structures, which leads to inefficiency and time - consuming in the drug discovery process. 3. **Lack of characteristics of approved drugs**: Existing models fail to fully consider the characteristics of approved drugs, which limits their ability to generate high - quality drug candidates. To solve these problems, the authors propose an enhanced generative model - DrugGen. DrugGen is based on the DrugGPT architecture and has been improved by the following methods: - **Supervised fine - tuning**: Fine - tuning is carried out using a dataset of approved drug - target pairs to improve the quality of the generated molecules. - **Reinforcement learning optimization**: The Proximal Policy Optimization (PPO) algorithm is used, combined with Protein - Ligand Binding Affinity Prediction (PLAPT) and a custom - made invalid structure evaluator, to further optimize the model performance. ### Main contributions 1. **Generate valid and diverse small molecules**: Evaluations of DrugGen on multiple targets show that it can generate 100% valid structures, and the generated molecules have a higher predicted binding affinity (7.22 [6.30 - 8.07]) compared to DrugGPT (5.81 [4.97 - 6.63]). 2. **Improve binding affinity**: Verified by molecular docking simulation, the molecules generated by DrugGen can target binding sites more effectively. For example, in the case of Fatty Acid - Binding Protein 5 (FABP5), the generated molecules have higher docking scores (FABP5/11, - 9.537 and FABP5/5, - 8.399) compared to the reference molecule (palmitic acid, - 6.177). 3. **Potential for drug re - positioning**: DrugGen not only performs well in generating new drug candidates, but also shows potential in drug re - positioning and creating new pharmacophores for existing targets. ### Conclusion As a powerful early - stage drug discovery tool, DrugGen is expected to significantly accelerate the identification process of new lead compounds. Through further optimization and experimental verification, DrugGen has the potential to become an important part of the future drug discovery pipeline and develop new drugs for the treatment of various diseases.