Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification

Gexin Huang,Chenfei Wu,Mingjie Li,Xiaojun Chang,Ling Chen,Ying Sun,Shen Zhao,Xiaodan Liang,Liang Lin
2024-06-05
Abstract:Predicting genetic mutations from whole slide images is indispensable for cancer diagnosis. However, existing work training multiple binary classification models faces two challenges: (a) Training multiple binary classifiers is inefficient and would inevitably lead to a class imbalance problem. (b) The biological relationships among genes are overlooked, which limits the prediction performance. To tackle these challenges, we innovatively design a Biological-knowledge enhanced PathGenomic multi-label Transformer to improve genetic mutation prediction performances. BPGT first establishes a novel gene encoder that constructs gene priors by two carefully designed modules: (a) A gene graph whose node features are the genes' linguistic descriptions and the cancer phenotype, with edges modeled by genes' pathway associations and mutation consistencies. (b) A knowledge association module that fuses linguistic and biomedical knowledge into gene priors by transformer-based graph representation learning, capturing the intrinsic relationships between different genes' mutations. BPGT then designs a label decoder that finally performs genetic mutation prediction by two tailored modules: (a) A modality fusion module that firstly fuses the gene priors with critical regions in WSIs and obtains gene-wise mutation logits. (b) A comparative multi-label loss that emphasizes the inherent comparisons among mutation status to enhance the discrimination capabilities. Sufficient experiments on The Cancer Genome Atlas benchmark demonstrate that BPGT outperforms the state-of-the-art.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the problem of predicting gene mutations from Whole Slide Images (WSIs), which is of significant importance for cancer diagnosis. However, existing methods that train multiple binary classification models to predict gene mutations face two major challenges: 1. **Inefficiency and class imbalance**: Training multiple binary classifiers is not only inefficient but also inevitably leads to class imbalance issues. 2. **Ignoring biological relationships between genes**: Existing methods overlook the biological relationships between different genes, limiting the prediction performance. To tackle these challenges, the authors designed a biological-knowledge enhanced multi-label classifier—Biological-knowledge enhanced PathGenomic multi-label Transformer (BPGT), to improve gene mutation prediction performance. The main innovations of BPGT include: - **Constructing a novel gene encoder**: By using two carefully designed modules (gene graph and knowledge association module), the gene prior information is constructed, integrating language descriptions and biomedical knowledge to capture the intrinsic relationships between different gene mutations. - **Designing a label decoder**: Through a modality fusion module, the gene prior information is fused with key regions in the WSI, and a comparative multi-label loss function is employed to emphasize the inherent comparisons between mutation states, enhancing the discriminative ability. Experimental results show that BPGT outperforms existing state-of-the-art methods on The Cancer Genome Atlas benchmark dataset.