Rethinking the Masking Strategy for Pretraining Molecular Graphs from a Data-Centric View

Wei Lin,Chi Chung Alan Fung
DOI: https://doi.org/10.1021/acsomega.3c09512
IF: 4.1
2024-05-21
ACS Omega
Abstract:Node-level self-supervised learning has been widely applied for pretraining molecular graphs. Attribute Masking (AttrMask) is pioneering work in this field, and its improved methods focus on enhancing the capacity of the backbone models by incorporating additional modules. However, these methods overlook the imbalanced atom distribution due to employing only the random masking strategy to mask atoms for pretraining. According to the properties of molecules, we propose a weighted masking strategy...
chemistry, multidisciplinary
What problem does this paper attempt to address?