RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation

Yazhou Zhu,Minxian Li,Qiaolin Ye,Shidong Wang,Tong Xin,Haofeng Zhang
2024-10-09
Abstract:Few-shot medical image segmentation (FSMIS) aims to perform the limited annotated data learning in the medical image analysis scope. Despite the progress has been achieved, current FSMIS models are all trained and deployed on the same data domain, as is not consistent with the clinical reality that medical imaging data is always across different data domains (e.g. imaging modalities, institutions and equipment sequences). How to enhance the FSMIS models to generalize well across the different specific medical imaging domains? In this paper, we focus on the matching mechanism of the few-shot semantic segmentation models and introduce an Earth Mover's Distance (EMD) calculation based domain robust matching mechanism for the cross-domain scenario. Specifically, we formulate the EMD transportation process between the foreground support-query features, the texture structure aware weights generation method, which proposes to perform the sobel based image gradient calculation over the nodes, is introduced in the EMD matching flow to restrain the domain relevant nodes. Besides, the point set level distance measurement metric is introduced to calculated the cost for the transportation from support set nodes to query set nodes. To evaluate the performance of our model, we conduct experiments on three scenarios (i.e., cross-modal, cross-sequence and cross-institution), which includes eight medical datasets and involves three body regions, and the results demonstrate that our model achieves the SoTA performance against the compared models.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to improve the generalization ability of the model between different data domains when performing cross - domain few - shot medical image segmentation (CD - FSMIS). Specifically, current few - shot medical image segmentation models are usually trained and deployed on the same data domain, which is inconsistent with the actual clinical situation because medical imaging data often come from different modalities, institutions and device sequences. Therefore, how to enhance these models so that they can better generalize to different specific medical imaging domains has become an important research topic. ### Background of the Paper Few - shot medical image segmentation (FSMIS) aims to learn within the scope of medical image analysis using limited labeled data. Although some progress has been made in this regard, existing FSMIS models are all trained and deployed on the same data domain, which is inconsistent with the situation in clinical reality where medical imaging data always span different data domains. Therefore, how to enhance the FSMIS model so that it performs well in different specific medical imaging domains is an important problem. ### Main Contributions 1. **Proposed a matching mechanism based on Earth Mover's Distance (EMD)**: This mechanism is used to calibrate foreground features to enhance the robustness of cross - domain matching. 2. **Introduced a texture - complexity - aware node weight generation method**: Weights are generated by calculating the gradient - signal complexity of node features. 3. **Developed a boundary - preserving point - set distance function**: As a node - transfer - cost function, it is used to calculate the transfer cost between support nodes and query nodes. 4. **Experimental verification in multiple cross - domain scenarios**: Including cross - modality, cross - sequence and cross - institution medical data sets, and the results show that the model achieves state - of - the - art performance in these scenarios. ### Method Overview 1. **Foreground feature generation**: Extract foreground features from support images and query images. 2. **EMD - based domain - robust matching mechanism**: Calculate the correlation score between support foreground features and query foreground features. 3. **Calibration and prediction**: Use the correlation score to transform features from domain - sensitive to domain - invariant and make the final segmentation prediction. ### Experimental Results The paper conducted experiments on three cross - domain scenarios (cross - modality, cross - sequence and cross - institution), involving six medical data sets and three body parts. The experimental results show that the proposed RobustEMD model outperforms other comparison models in these scenarios. ### Formula Representation - **Node weight generation**: \[ lvn_i=\frac{1}{HW}\sum_{x = 1}^{W}\sum_{y = 1}^{H}(\mu^2(x,y)-\mu(x,y)^2) \] \[ \mu^2(x,y)=\frac{1}{k^2}\sum_{i=-\frac{k}{2}}^{\frac{k}{2}}\sum_{j =-\frac{k}{2}}^{\frac{k}{2}}(g_n^i(x + i,y + j))^2 \] \[ \mu(x,y)=\frac{1}{k^2}\sum_{i=-\frac{k}{2}}^{\frac{k}{2}}\sum_{j =-\frac{k}{2}}^{\frac{k}{2}}g_n^i(x + i,y + j) \] \[ x_n^i=e^{-lvn_i} \] - **Cost function**: \[ c_n^{ij}=\max\{\sup_{x\in u_n^i}d(x,v_n^j),\sup_{y\in v_n^j}d(y,u_n^i)\} \] \[ d(p,A)=\inf_{p\in A}\text{dist}(p,y) \]