3M-Diffusion: Latent Multi-Modal Diffusion for Language-Guided Molecular Structure Generation

Huaisheng Zhu,Teng Xiao,Vasant G Honavar
2024-10-03
Abstract:Generating molecular structures with desired properties is a critical task with broad applications in drug discovery and materials design. We propose 3M-Diffusion, a novel multi-modal molecular graph generation method, to generate diverse, ideally novel molecular structures with desired properties. 3M-Diffusion encodes molecular graphs into a graph latent space which it then aligns with the text space learned by encoder-based LLMs from textual descriptions. It then reconstructs the molecular structure and atomic attributes based on the given text descriptions using the molecule decoder. It then learns a probabilistic mapping from the text space to the latent molecular graph space using a diffusion model. The results of our extensive experiments on several datasets demonstrate that 3M-Diffusion can generate high-quality, novel and diverse molecular graphs that semantically match the textual description provided.
Machine Learning,Computation and Language,Biomolecules
What problem does this paper attempt to address?
This paper attempts to address the problem of generating high-quality, diverse novel molecular structures with desired properties given textual descriptions. Specifically, the paper proposes a new method called 3M-Diffusion, which is capable of generating diverse and novel molecular graphs from textual descriptions. Experimental results on multiple datasets demonstrate that this method can generate high-quality molecular graphs that semantically match the textual descriptions. The main contributions of 3M-Diffusion include: 1. Proposing a multimodal diffusion model for generating molecular structures based on textual descriptions, surpassing the limitations of existing methods. 2. Aligning the latent spaces of molecular graphs and textual descriptions to generate higher quality, more diverse, and novel molecular structures. 3. Outperforming existing state-of-the-art methods in four real-world text-based molecular graph generation benchmarks, achieving significant performance improvements on some datasets. For example, on the PCDes dataset, 3M-Diffusion improved novelty and diversity by 146.27% and 130.04% respectively compared to state-of-the-art methods, while maintaining semantic consistency with the text prompts.