Abstract:Molecular property prediction refers to the task of labeling molecules with some biochemical properties, playing a pivotal role in the drug discovery and design process. Recently, with the advancement of machine learning, deep learning-based molecular property prediction has emerged as a solution to the resource-intensive nature of traditional methods, garnering significant attention. Among them, molecular representation learning is the key factor for molecular property prediction performance. And there are lots of sequence-based, graph-based, and geometry-based methods that have been proposed. However, the majority of existing studies focus solely on one modality for learning molecular representations, failing to comprehensively capture molecular characteristics and information. In this paper, a novel multi-modal representation learning model, which integrates the sequence, graph, and geometry characteristics, is proposed for molecular property prediction, called SGGRL. Specifically, we design a fusion layer to fusion the representation of different modalities. Furthermore, to ensure consistency across modalities, SGGRL is trained to maximize the similarity of representations for the same molecule while minimizing similarity for different molecules. To verify the effectiveness of SGGRL, seven molecular datasets, and several baselines are used for evaluation and comparison. The experimental results demonstrate that SGGRL consistently outperforms the baselines in most cases. This further underscores the capability of SGGRL to comprehensively capture molecular information. Overall, the proposed SGGRL model showcases its potential to revolutionize molecular property prediction by leveraging multi-modal representation learning to extract diverse and comprehensive molecular insights. Our code is released at

In-Context Learning of Physical Properties: Few-Shot Adaptation to Out-of-Distribution Molecular Graphs

In-Context Learning for Few-Shot Molecular Property Prediction

Few-shot molecular property prediction via Hierarchically Structured Learning on Relation Graphs

Chemical Property Relation Guided Few-Shot Molecular Property Prediction

Few-shot learning with transformers via graph embeddings for molecular property prediction

Implicit Geometry and Interaction Embeddings Improve Few-Shot Molecular Property Prediction

Few-shot learning via graph embeddings with convolutional networks for low-data molecular property prediction

Property-Aware Relation Networks for Few-Shot Molecular Property Prediction

KPGT: Knowledge-Guided Pre-training of Graph Transformer for Molecular Property Prediction

MolecularGPT: Open Large Language Model (LLM) for Few-Shot Molecular Property Prediction

Knowledge-enhanced Relation Graph and Task Sampling for Few-shot Molecular Property Prediction

Geometry-aware Line Graph Transformer Pre-training for Molecular Property Prediction

Molecular Property Prediction: A Multilevel Quantum Interactions Modeling Perspective

Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction

HimGNN: a novel hierarchical molecular graph representation learning framework for property prediction

Molecular Graph Transformer: Stepping Beyond ALIGNN Into Long-Range Interactions

Describe Molecules by a Heterogeneous Graph Neural Network with Transformer-like Attention for Supervised Property Predictions

Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, Geometry

Large property models: a new generative machine-learning formulation for molecules

Large Property Models: A New Generative Paradigm for Molecules