An Exploration Strategy Improves the Diversity of De Novo Ligands Using Deep Reinforcement Learning: a Case for the Adenosine A2A Receptor

Xuhan Liu,Kai Ye,Herman W. T. van Vlijmen,Adriaan P. IJzerman,Gerard J. P. van Westen
DOI: https://doi.org/10.1186/s13321-019-0355-6
2019-01-01
Journal of Cheminformatics
Abstract:Over the last fiveyears deep learning has progressed tremendously in both image recognition andnatural language processing. Now it is increasingly applied to other data richfields. In drug discovery, recurrent neural networks (RNNs) have been shown tobe an effective method to generate novel chemical structures in the form ofSMILES. However, ligands generated by current methods have so far provided relatively lowdiversity and do not fully cover the whole chemicalspace occupied by known ligands. Here, we propose a new method (DrugEx) todiscover de novo drug-like molecules.DrugEx is an RNN model (generator) trained through reinforcement learning whichwas integrated with a special exploration strategy. As a case study we appliedour method to design ligands against the adenosine A2A receptor.From ChEMBL data, a machine learning model (predictor) was created to predictwhether generated molecules are active or not. Based on this predictor as thereward function, the generator was trained by reinforcement learning withoutany further data. We then compared the performance of our method with twopreviously published methods, REINVENT and ORGANIC. We found that candidatemolecules our model designed, and predicted to be active, had a larger chemicaldiversity, and better covered the chemical space of known ligands compared tothe state-of-the-art.
What problem does this paper attempt to address?