Searching for High-Value Molecules Using Reinforcement Learning and Transformers

Raj Ghugare,Santiago Miret,Adriana Hugessen,Mariano Phielipp,Glen Berseth
2023-10-04
Abstract:Reinforcement learning (RL) over text representations can be effective for finding high-value policies that can search over graphs. However, RL requires careful structuring of the search space and algorithm design to be effective in this challenge. Through extensive experiments, we explore how different design choices for text grammar and algorithmic choices for training can affect an RL policy's ability to generate molecules with desired properties. We arrive at a new RL-based molecular design algorithm (ChemRLformer) and perform a thorough analysis using 25 molecule design tasks, including computationally complex protein docking simulations. From this analysis, we discover unique insights in this problem space and show that ChemRLformer achieves state-of-the-art performance while being more straightforward than prior work by demystifying which design choices are actually helpful for text-based molecule design.
Machine Learning,Materials Science,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to find molecules with high - value properties in molecular design. Specifically, the paper explores how to use reinforcement learning (RL) and text - based representation methods (such as SMILES and SELFIES) to train models to generate molecules with specific chemical properties. The authors point out that although the graph structure is very effective in describing molecules, its large search space and complex action space make graph neural networks (GNNs) difficult to train in practical applications. In contrast, although text - based methods may face challenges when constructing Markov decision processes (MDPs), they show potential advantages in molecular design because they simplify the action space and can utilize large - scale text data sets. The main contribution of the paper is to propose a new text - based molecular design algorithm - ChemRLformer. This algorithm achieves state - of - the - art performance on 25 molecular design tasks by using pre - training and carefully designed reinforcement learning strategies while maintaining the simplicity of the algorithm. In addition, through extensive experimental analysis, the paper also explores the influence of different text grammars and training algorithm choices on the ability of RL strategies to generate molecules with the required properties, revealing unique insights in this problem space.