Uncertainty bidirectional guidance of multi-task mamba network for medical image classification and segmentation

Xingao Wu,Gang Gou
DOI: https://doi.org/10.1007/s11760-024-03633-z
IF: 1.583
2024-12-04
Signal Image and Video Processing
Abstract:Multi-task learning for joint medical image segmentation and classification holds promise for enhancing diagnostic accuracy and reliability in clinical settings. Current approaches often rely on unidirectionally bootstrapping one task with a single high-level feature from another, which fails to leverage valuable information fully and can lead to suboptimal outcomes and diagnostic errors. Multi-task learning frameworks based on Convolutional Neural Networks (CNNs) or Vision Transformers (ViTs) have achieved significant success in medical image analysis. However, CNNs struggle with long sequence information, and ViTs are computationally intensive. Based on this, in this paper, we propose a novel Uncertainty Bidirectional Guidance of multi-task Mamba network (UBGM) for efficient and reliable medical image analysis. UBGM's encoder utilizes a Mamba structure, excelling in remote modeling and maintaining computational efficiency with linear complexity. The uncertainty coarse segmentation guidance module performs interactive learning between tasks, generating multiple high-level features for classification and coarse segmentation results by incorporating uncertainty. To better utilize segmentation information, we designed an uncertainty classification decoder to produce category information and features for assisted segmentation correction. Real bidirectional guidance is achieved by the mutual assistance of both tasks, improving model performance. Experiments on public datasets demonstrate that UBGM outperforms existing benchmark models, showing potential for high performance and reliability in multi-task networks.
engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?