An ensemble of VisNet, Transformer-M, and pretraining models for molecular property prediction in OGB Large-Scale Challenge @ NeurIPS 2022

Yusong Wang,Shaoning Li,Zun Wang,Xinheng He,Bin Shao,Tie-Yan Liu,Tong Wang
2023-08-16
Abstract:In the technical report, we provide our solution for OGB-LSC 2022 Graph Regression Task. The target of this task is to predict the quantum chemical property, HOMO-LUMO gap for a given molecule on PCQM4Mv2 dataset. In the competition, we designed two kinds of models: Transformer-M-ViSNet which is an geometry-enhanced graph neural network for fully connected molecular graphs and Pretrained-3D-ViSNet which is a pretrained ViSNet by distilling geomeotric information from optimized structures. With an ensemble of 22 models, ViSNet Team achieved the MAE of 0.0723 eV on the test-challenge set, dramatically reducing the error by 39.75% compared with the best method in the last year competition.
Machine Learning,Chemical Physics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to predict the quantum - chemical properties of a given molecule, especially the HOMO - LUMO energy gap. Specifically, the authors designed two models for the OGB - LSC 2022 graph regression task to achieve this goal. These models aim to utilize the 2D topological structure and 3D geometric information of molecules to improve the prediction accuracy. The PCQM4Mv2 dataset is used to train and test these models, which provides optimized 3D molecular structures, but these structures are not provided in the validation set and the test set. Therefore, how to effectively utilize the 3D molecular structure and combine it with the 2D topological structure of the molecule has become the key challenge in solving this problem. To meet this challenge, the authors proposed the following solutions: 1. **Transformer - M - ViSNet**: This is an improved Transformer model that can better extract and utilize geometric information. Through powerful vector - scalar interaction operations, this model can handle fully - connected molecular graphs and use 3D attention bias in each Transformer layer to enhance the utilization of geometric information. 2. **Pretrained - 3D - ViSNet**: This is a pre - trained ViSNet model. By distilling the hidden representation of geometric information from the optimized structure and then applying it to the generated structure, the effective utilization of the 3D molecular structure is achieved. This method helps to reduce the impact of noise in the generated structure and improve the prediction performance of the model. By combining the prediction results of these two models, the ViSNet team achieved a mean absolute error (MAE) of 0.0723 eV in the test challenge set, with a 39.75% reduction in error compared to the best method in last year's competition. This shows that their method has achieved significant results in utilizing 3D molecular structures, which helps to narrow the gap between molecular topological structures and 3D geometric information, thus promoting the development of molecular modeling research.