Focus on informative graphs! Semi-supervised active learning for graph-level classification
Wei Ju,Zhengyang Mao,Ziyue Qiao,Yifang Qin,Siyu Yi,Zhiping Xiao,Xiao Luo,Yanjie Fu,Ming Zhang
DOI: https://doi.org/10.1016/j.patcog.2024.110567
IF: 8
2024-05-07
Pattern Recognition
Abstract:Graph-level classification is a critical problem in social analysis and bioinformatics. Since annotated labels are typically costly, we intend to study this challenging task in semi-supervised scenarios with limited budgets. Inspired by the fact that active learning is capable of interactively querying an oracle to annotate a small proportion of informative samples in the unlabeled dataset, we present a novel S emi-su p ervised a ctive learning framework termed GraphSpa for graph-level classification. To make the most of labeling budgets, we develop an effective unlabeled data selection strategy that takes both local similarity and global semantic structure into account. Specifically, we first construct an adaptive queue with labeled samples and select informative samples that have a low degree of similarity to the queue using the Min-Max principle from the local view. Further, we introduce class prototypes and select samples with a large predictive loss discrepancy from the global view. To fully leverage the unlabeled data, we develop a semi-supervised active learning framework on the basis of our fusion selection strategy coupled with graph contrastive learning during active learning. Experimental results on various real-world benchmark datasets verify the efficacy of our GraphSpa against state-of-the-art methods.
computer science, artificial intelligence,engineering, electrical & electronic