Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, Geometry

Zeyu Wang,Tianyi Jiang,Jinhuan Wang,Qi Xuan
2024-01-09
Abstract:Molecular property prediction refers to the task of labeling molecules with some biochemical properties, playing a pivotal role in the drug discovery and design process. Recently, with the advancement of machine learning, deep learning-based molecular property prediction has emerged as a solution to the resource-intensive nature of traditional methods, garnering significant attention. Among them, molecular representation learning is the key factor for molecular property prediction performance. And there are lots of sequence-based, graph-based, and geometry-based methods that have been proposed. However, the majority of existing studies focus solely on one modality for learning molecular representations, failing to comprehensively capture molecular characteristics and information. In this paper, a novel multi-modal representation learning model, which integrates the sequence, graph, and geometry characteristics, is proposed for molecular property prediction, called SGGRL. Specifically, we design a fusion layer to fusion the representation of different modalities. Furthermore, to ensure consistency across modalities, SGGRL is trained to maximize the similarity of representations for the same molecule while minimizing similarity for different molecules. To verify the effectiveness of SGGRL, seven molecular datasets, and several baselines are used for evaluation and comparison. The experimental results demonstrate that SGGRL consistently outperforms the baselines in most cases. This further underscores the capability of SGGRL to comprehensively capture molecular information. Overall, the proposed SGGRL model showcases its potential to revolutionize molecular property prediction by leveraging multi-modal representation learning to extract diverse and comprehensive molecular insights. Our code is released at
Molecular Networks,Machine Learning,Biomolecules
What problem does this paper attempt to address?
The paper aims to address the problem of molecular property prediction, specifically how to predict the biochemical characteristics of molecules more accurately. Current methods primarily focus on single-modal representation learning, such as sequence, graph, or geometric methods, which fail to capture the comprehensive molecular information. The paper proposes a new multimodal representation learning model called SGGRL, which combines sequence, graph, and geometric features to improve the performance of molecular property prediction. By integrating representations from different modalities through fusion layers and ensuring consistency through contrastive learning, SGGRL outperforms baseline methods on multiple molecular datasets, indicating its ability to capture molecular information comprehensively.