Abstract:Effective molecular representation learning is very important for Artificial Intelligence-driven Drug Design because it affects the accuracy and efficiency of molecular property prediction and other molecular modeling relevant tasks. However, previous molecular representation learning studies often suffer from limitations, such as over-reliance on a single molecular representation, failure to fully capture both local and global information in molecular structure, and ineffective integration of multiscale features from different molecular representations. These limitations restrict the complete and accurate representation of molecular structure and properties, ultimately impacting the accuracy of predicting molecular properties. To this end, we propose a novel multi-view molecular representation learning method called MvMRL, which can incorporate feature information from multiple molecular representations and capture both local and global information from different views well, thus improving molecular property prediction. Specifically, MvMRL consists of four parts: a multiscale CNN-SE Simplified Molecular Input Line Entry System (SMILES) learning component and a multiscale Graph Neural Network encoder to extract local feature information and global feature information from the SMILES view and the molecular graph view, respectively; a Multi-Layer Perceptron network to capture complex non-linear relationship features from the molecular fingerprint view; and a dual cross-attention component to fuse feature information on the multi-views deeply for predicting molecular properties. We evaluate the performance of MvMRL on 11 benchmark datasets, and experimental results show that MvMRL outperforms state-of-the-art methods, indicating its rationality and effectiveness in molecular property prediction. The source code of MvMRL was released in https://github.com/jedison-github/MvMRL.

Self-Supervised Graph Information Bottleneck for Multiview Molecular Embedding Learning

An Image-enhanced Molecular Graph Representation Learning Framework

Multi-View Graph Neural Networks for Molecular Property Prediction

Multiview Deep Graph Infomax to Achieve Unsupervised Graph Embedding

An effective self-supervised framework for learning expressive molecular global representations to drug discovery

Cross-dependent graph neural networks for molecular property prediction

Self-Supervised Molecular Representation Learning With Topology and Geometry

MultiModal-Learning for Predicting Molecular Properties: A Framework Based on Image and Graph Structures

Utilizing Edge Features in Graph Neural Networks Via Variational Information Maximization

AEGNN-M:A 3D Graph-Spatial Co-Representation Model for Molecular Property Prediction

A Knowledge-Driven Self-Supervised Approach for Molecular Generation

MvMRL: a multi-view molecular representation learning method for molecular property prediction

Molecular Property Prediction Based on Graph Structure Learning

Molecular Joint Representation Learning via Multi-modal Information of SMILES and Graphs

Molecular Graph Representation Learning via Structural Similarity Information

Graph neural network for 3-dimensional structures including dihedral angles for molecular property prediction

Triple Generative Self-Supervised Learning Method for Molecular Property Prediction

HiGNN: Hierarchical Informative Graph Neural Networks for Molecular Property Prediction Equipped with Feature-Wise Attention

Evaluating Self-Supervised Learning for Molecular Graph Embeddings

Pre-training Molecular Graph Representation with 3D Geometry

Molecular Joint Representation Learning via Multi-modal Information