ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance

Ling-Hao Chen,Yuanshuo Zhang,Taohua Huang,Liangcai Su,Zeyi Lin,Xi Xiao,Xiaobo Xia,Tongliang Liu
DOI: https://doi.org/10.1145/3627673.3679552
2024-01-01
Abstract:Deep learning has achieved remarkable success in graph-related tasks, yetthis accomplishment heavily relies on large-scale high-quality annotateddatasets. However, acquiring such datasets can be cost-prohibitive, leading tothe practical use of labels obtained from economically efficient sources suchas web searches and user tags. Unfortunately, these labels often come withnoise, compromising the generalization performance of deep networks. To tacklethis challenge and enhance the robustness of deep learning models against labelnoise in graph-based tasks, we propose a method called ERASE (Error-Resilientrepresentation learning on graphs for lAbel noiSe tolerancE). The core idea ofERASE is to learn representations with error tolerance by maximizing codingrate reduction. Particularly, we introduce a decoupled label propagation methodfor learning representations. Before training, noisy labels are pre-correctedthrough structural denoising. During training, ERASE combines prototypepseudo-labels with propagated denoised labels and updates representations witherror resilience, which significantly improves the generalization performancein node classification. The proposed method allows us to more effectivelywithstand errors caused by mislabeled nodes, thereby strengthening therobustness of deep networks in handling noisy graph data. Extensiveexperimental results show that our method can outperform multiple baselineswith clear margins in broad noise levels and enjoy great scalability. Codes arereleased at https://github.com/eraseai/erase.
What problem does this paper attempt to address?