Mind the Gap: Promoting Missing Modality Brain Tumor Segmentation with Alignment

Tianyi Liu,Zhaorui Tan,Haochuan Jiang,Xi Yang,Kaizhu Huang
2024-09-28
Abstract:Brain tumor segmentation is often based on multiple magnetic resonance imaging (MRI). However, in clinical practice, certain modalities of MRI may be missing, which presents an even more difficult scenario. To cope with this challenge, knowledge distillation has emerged as one promising strategy. However, recent efforts typically overlook the modality gaps and thus fail to learn invariant feature representations across different modalities. Such drawback consequently leads to limited performance for both teachers and students. To ameliorate these problems, in this paper, we propose a novel paradigm that aligns latent features of involved modalities to a well-defined distribution anchor. As a major contribution, we prove that our novel training paradigm ensures a tight evidence lower bound, thus theoretically certifying its effectiveness. Extensive experiments on different backbones validate that the proposed paradigm can enable invariant feature representations and produce a teacher with narrowed modality gaps. This further offers superior guidance for missing modality students, achieving an average improvement of 1.75 on dice score.
Image and Video Processing,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to improve the accuracy of brain tumor segmentation in clinical practice when certain MRI modalities may be missing due to data corruption or changes in scanning protocols. Specifically, when some modalities in multi - modal MRI are missing, the existing knowledge distillation methods usually ignore the differences between modalities (i.e., modality gap), which causes the teacher model to be unable to learn invariant feature representations, thus limiting the performance of the student model. To solve this problem, the authors propose a new alignment paradigm. By aligning the latent features of different modalities to a predefined distribution anchor point (called \( P_{\text{mix}} \)), the modality gap is reduced and cross - modal invariant feature learning is promoted. This method not only improves the learning effect of the teacher model but also provides better guidance for the student model dealing with missing modalities. ### Main contributions 1. **Propose a new alignment paradigm**: By introducing the latent space distribution \( P_{\text{mix}} \) as an alignment anchor point, learn cross - modal invariant features. 2. **Theoretical support**: It is proved that aligning each modality separately to the optimal \( P_{\text{mix}} \) can ensure a tighter evidence lower bound (ELBO), which is better than mapping all modalities as a whole to \( P_{\text{mix}} \). 3. **Experimental verification**: Through extensive experiments, the improvement of this paradigm on the performance of brain tumor segmentation in the latest SOTA backbone networks is verified, especially in the case of dealing with missing modalities. ### Key points of the solution - **Reduction of modality gap**: By aligning the latent features of different modalities to \( P_{\text{mix}} \), the differences between modalities are reduced and the learning of shared features is promoted. - **Improved knowledge distillation**: By providing better guidance from the aligned teacher model, the performance of the student model in dealing with missing modalities is improved. - **Optimized \( P_{\text{mix}} \)**: By weighted combination of the distributions of each modality, the optimal \( P_{\text{mix}} \) is found, which further improves the generalization ability of the model. ### Experimental results The experimental results show that the teacher model using \( P_{\text{mix}} \) as an alignment anchor point can significantly improve the Dice score of the student model, with an average increase of 1.75 points, especially in the case of dealing with the missing of three modalities. Through these improvements, this research provides an effective solution for solving the problem of missing modalities in multi - modal brain tumor segmentation.