SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image Classification

Tong Shu,Jun Shi,Dongdong Sun,Zhiguo Jiang,Yushan Zheng
2024-07-19
Abstract:Existing WSI analysis methods lie on the consensus that histopathological characteristics of tumors are significant guidance for cancer diagnostics. Particularly, as the evolution of cancers is a continuous process, the correlations and differences across various stages, anatomical locations and patients should be taken into account. However, recent research mainly focuses on the inner-contextual information in a single WSI, ignoring the correlations between slides. To verify whether introducing the slide inter-correlations can bring improvements to WSI representation learning, we propose a generic WSI analysis pipeline SlideGCD that considers the existing multi-instance learning (MIL) methods as the backbone and forge the WSI classification task as a node classification problem. More specifically, SlideGCD declares a node buffer that stores previous slide embeddings for subsequent extensive slide-based graph construction and conducts graph learning to explore the inter-correlations implied in the slide-based graph. Moreover, we frame the MIL classifier and graph learning into two parallel workflows and deploy the knowledge distillation to transfer the differentiable information to the graph neural network. The consistent performance boosting, brought by SlideGCD, of four previous state-of-the-art MIL methods is observed on two TCGA benchmark datasets. The code is available at <a class="link-external link-https" href="https://github.com/HFUT-miaLab/SlideGCD" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the problem of improving the Whole Slide Image (WSI) classification task by introducing the interrelationships between slides. Specifically, existing WSI analysis methods mainly focus on the contextual information within a single WSI, while ignoring the correlations between different slides. The paper proposes a general WSI analysis framework called SlideGCD, which considers Multi-Instance Learning (MIL) methods as the foundation and transforms the WSI classification task into a node classification problem. SlideGCD enhances the performance of existing MIL methods on multiple benchmark datasets by constructing a slide-based graph structure and utilizing graph learning to explore the implicit correlations between slides. The main contributions of SlideGCD include: 1. Proposing a general pathological WSI analysis pipeline, SlideGCD, which can adapt to any existing MIL method. 2. Adopting a rehearsal-based graph construction strategy to describe the interrelationships between slides. 3. Utilizing knowledge distillation techniques for collaborative training to fully leverage the knowledge learned in the MIL classifier.