SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation

Lin Zhang,Wenbo Gao,Jie Yi,Yunyun Yang
2024-07-14
Abstract:Multi-organ segmentation in medical image analysis is crucial for diagnosis and treatment planning. However, many factors complicate the task, including variability in different target categories and interference from complex backgrounds. In this paper, we utilize the knowledge of Deformable Convolution V3 (DCNv3) and multi-object segmentation to optimize our Spatially Adaptive Convolution Network (SACNet) in three aspects: feature extraction, model architecture, and loss constraint, simultaneously enhancing the perception of different segmentation targets. Firstly, we propose the Adaptive Receptive Field Module (ARFM), which combines DCNv3 with a series of customized block-level and architecture-level designs similar to transformers. This module can capture the unique features of different organs by adaptively adjusting the receptive field according to various targets. Secondly, we utilize ARFM as building blocks to construct the encoder-decoder of SACNet and partially share parameters between the encoder and decoder, making the network wider rather than deeper. This design achieves a shared lightweight decoder and a more parameter-efficient and effective framework. Lastly, we propose a novel continuity dynamic adjustment loss function, based on t-vMF dice loss and cross-entropy loss, to better balance easy and complex classes in segmentation. Experiments on 3D slice datasets from ACDC and Synapse demonstrate that SACNet delivers superior segmentation performance in multi-organ segmentation tasks compared to several existing methods.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to address several key challenges in multi - organ medical image segmentation: 1. **Variability of different target classes**: Different organs in the same dataset vary significantly in size, shape, and texture, and the same organ also has large differences in morphology in different slices. This variability makes it difficult for the segmentation model to generalize effectively, and it is prone to over - fitting the specific features seen in training and performing poorly on unseen morphological changes. 2. **Interference from complex backgrounds**: The complexity of surrounding anatomical structures, different textures, intensities, and the presence of overlapping tissues introduce a great deal of difficulties. These complex backgrounds may lead to blurred boundaries, making it difficult to accurately identify and segment target organs, resulting in incomplete or inaccurate segmentation results. To meet these challenges, the paper proposes a new framework named SACNet (Spatially Adaptive Convolution Network), which contains three main innovations: 1. **Adaptive Receptive Field Module (ARFM)**: Combining Deformable Convolution V3 (DCNv3) and a series of Transformer - like custom block - level and architecture - level designs, ARFM can adaptively adjust the receptive field according to different segmentation targets and capture the unique features of different organs. 2. **Widenet strategy**: By sharing the projection weights of DCNv3 blocks between the encoder and the decoder, SACNet expands in network width instead of deepening the network depth, achieving a shared lightweight decoder and improving parameter efficiency and model performance. 3. **Continuous - time Dynamic Adjustment of Loss Function (CTLoss)**: Based on t - vMF Dice loss and cross - entropy loss, CTLoss adaptively adjusts the loss function according to the intersection - over - union (IOU) of each segmentation class on the validation set after each epoch, better balancing the segmentation of easy and difficult classes. The paper verifies the effectiveness of SACNet through experiments on two 3D slice datasets, ACDC and Synapse. The results show that SACNet has higher segmentation performance in multi - organ segmentation tasks compared to existing methods.