DropConn: Dropout Connection Based Random GNNs for Molecular Property Prediction

Dan Zhang,Wenzheng Feng,Yuandong Wang,Zhongang Qi,Ying Shan,Jie Tang
DOI: https://doi.org/10.1109/tkde.2023.3290032
IF: 9.235
2023-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Recently, molecular data mining has attracted a lot of attention owing to its great application potential in material and drug discovery. However, this mining task faces a challenge posed by the scarcity of labeled molecular graphs. To overcome this challenge, we introduce a novel data augmentation and a semi-supervised confidence-aware consistency regularization training framework for molecular property prediction. The core of our framework is a data augmentation strategy on molecular graphs, named DropConn (Dropout Connection). DropConn generates pseudo molecular graphs by softening the hard connections of chemical bonds (as edges), where the soft weights are calculated from edge features so that the adaptive interactions between different atoms can be incorporated. Besides, to enhance the model's generalization ability, a consistency regularization training strategy is proposed to take full advantage of massive unlabeled data. Furthermore, DropConn can serve as a plugin that can be seamlessly added to many existing models. Extensive experiments under both non-pre-training setting and fine-tuning setting demonstrate that DropConn can obtain superior performance (up to 8.22%) over state-of-the-art methods on molecular property prediction tasks.
What problem does this paper attempt to address?