Abstract:In the technical report, we provide our solution for OGB-LSC 2022 Graph Regression Task. The target of this task is to predict the quantum chemical property, HOMO-LUMO gap for a given molecule on PCQM4Mv2 dataset. In the competition, we designed two kinds of models: Transformer-M-ViSNet which is an geometry-enhanced graph neural network for fully connected molecular graphs and Pretrained-3D-ViSNet which is a pretrained ViSNet by distilling geomeotric information from optimized structures. With an ensemble of 22 models, ViSNet Team achieved the MAE of 0.0723 eV on the test-challenge set, dramatically reducing the error by 39.75% compared with the best method in the last year competition.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to predict the quantum - chemical properties of a given molecule, especially the HOMO - LUMO energy gap. Specifically, the authors designed two models for the OGB - LSC 2022 graph regression task to achieve this goal. These models aim to utilize the 2D topological structure and 3D geometric information of molecules to improve the prediction accuracy. The PCQM4Mv2 dataset is used to train and test these models, which provides optimized 3D molecular structures, but these structures are not provided in the validation set and the test set. Therefore, how to effectively utilize the 3D molecular structure and combine it with the 2D topological structure of the molecule has become the key challenge in solving this problem. To meet this challenge, the authors proposed the following solutions: 1. **Transformer - M - ViSNet**: This is an improved Transformer model that can better extract and utilize geometric information. Through powerful vector - scalar interaction operations, this model can handle fully - connected molecular graphs and use 3D attention bias in each Transformer layer to enhance the utilization of geometric information. 2. **Pretrained - 3D - ViSNet**: This is a pre - trained ViSNet model. By distilling the hidden representation of geometric information from the optimized structure and then applying it to the generated structure, the effective utilization of the 3D molecular structure is achieved. This method helps to reduce the impact of noise in the generated structure and improve the prediction performance of the model. By combining the prediction results of these two models, the ViSNet team achieved a mean absolute error (MAE) of 0.0723 eV in the test challenge set, with a 39.75% reduction in error compared to the best method in last year's competition. This shows that their method has achieved significant results in utilizing 3D molecular structures, which helps to narrow the gap between molecular topological structures and 3D geometric information, thus promoting the development of molecular modeling research.

An ensemble of VisNet, Transformer-M, and pretraining models for molecular property prediction in OGB Large-Scale Challenge @ NeurIPS 2022

ViSNet: an equivariant geometry-enhanced graph neural network with vector-scalar interactive message passing for molecules

3D-Transformer: Molecular Representation with Transformer in 3D Space

Enhancing geometric representations for molecules with equivariant vector-scalar interactive message passing

Dual-view Molecular Pre-training

ViSNet: a scalable and accurate geometric deep learning potential for molecular dynamics simulation

Pretraining Graph Transformer for Molecular Representation with Fusion of Multimodal Information

Dynamic Molecular Graph-based Implementation for Biophysical Properties Prediction

Directed graph attention neural network utilizing 3D coordinates for molecular property prediction

Pre-trained Molecular Language Models with Random Functional Group Masking

Quantum-Informed Molecular Representation Learning Enhancing ADMET Property Prediction

Pre-training molecular representation model with spatial geometry for property prediction

Autoencoding Undirected Molecular Graphs With Neural Networks

Unified 2D and 3D Pre-Training of Molecular Representations

Molecular Geometry-aware Transformer for accurate 3D Atomic System modeling

Ensemble Model With Bert,Roberta and Xlnet For Molecular property prediction

Synergistic Fusion of Graph and Transformer Features for Enhanced Molecular Property Prediction

Describe Molecules by a Heterogeneous Graph Neural Network with Transformer-like Attention for Supervised Property Predictions

Molecular Graph Enhanced Transformer for Retrosynthesis Prediction

GTAM: A Molecular Pretraining Model with Geometric Triangle Awareness

Automated 3D Pre-Training for Molecular Property Prediction