A novel molecule generative model of VAE combined with Transformer for unseen structure generation

Yasuhiro Yoshikai,Tadahaya Mizuno,Shumpei Nemoto,Hiroyuki Kusuhara
2024-04-05
Abstract:Recently, molecule generation using deep learning has been actively investigated in drug discovery. In this field, Transformer and VAE are widely used as powerful models, but they are rarely used in combination due to structural and performance mismatch of them. This study proposes a model that combines these two models through structural and parameter optimization in handling diverse molecules. The proposed model shows comparable performance to existing models in generating molecules, and showed by far superior performance in generating molecules with unseen structures. Another advantage of this VAE model is that it generates molecules from latent representation, and therefore properties of molecules can be easily predicted or conditioned with it, and indeed, we show that the latent representation of the model successfully predicts molecular properties. Ablation study suggested the advantage of VAE over other generative models like language model in generating novel molecules. It also indicated that the latent representation can be shortened to ~32 dimensional variables without loss of reconstruction, suggesting the possibility of a much smaller molecular descriptor or model than existing ones. This study is expected to provide a virtual chemical library containing a wide variety of compounds for virtual screening and to enable efficient screening.
Biomolecules,Machine Learning,Chemical Physics
What problem does this paper attempt to address?
The paper aims to address the problem of molecular generation in the field of drug discovery, specifically how to use deep learning techniques to generate novel small molecules with potential medicinal value. Specifically, the research objectives include: 1. **Combining Variational Autoencoder (VAE) with Transformer Model**: Although VAE and Transformer are both powerful deep learning models and have been widely used in molecular generation tasks, they are rarely combined. This is due to structural and performance mismatches between them. This paper proposes a method to optimize the structure and parameters to successfully combine these two models. 2. **Generating Novel Molecular Structures**: The focus of the research is on generating molecular structures that have not appeared in the training dataset. This means that the model needs to be able to explore unexplored parts of the chemical space, which is crucial for new drug discovery. 3. **Maintaining Molecular Property Prediction Capability**: Since VAE can convert molecules into latent representations, it becomes easier to predict or adjust the properties of the generated molecules. This paper demonstrates that the latent representations of the model can successfully predict molecular properties and have advantages in generating molecules with desired characteristics. 4. **Optimizing Latent Space Dimensions**: The research also explores the dimensions of the latent variable space, finding that even reducing it to 32 dimensions can maintain good reconstruction performance. This indicates that molecules can be represented with more concise descriptors. Through the above research, the model not only shows excellent performance in generating molecules but also exhibits significant advantages in generating unseen molecular structures. This is of great significance for expanding virtual chemical libraries and improving virtual screening efficiency.