SRSNetwork: Siamese Reconstruction-Segmentation Networks based on Dynamic-Parameter Convolution

Bingkun Nian,Fenghe Tang,Jianrui Ding,Pingping Zhang,Jie Yang,S.Kevin Zhou,Wei Liu
2023-12-04
Abstract:In this paper, we present a high-performance deep neural network for weak target image segmentation, including medical image segmentation and infrared image segmentation. To this end, this work analyzes the existing dynamic convolutions and proposes dynamic parameter convolution (DPConv). Furthermore, it reevaluates the relationship between reconstruction tasks and segmentation tasks from the perspective of DPConv, leading to the proposal of a dual-network model called the Siamese Reconstruction-Segmentation Network (SRSNet). The proposed model is not only a universal network but also enhances the segmentation performance without altering its structure, leveraging the reconstruction task. Additionally, as the amount of training data for the reconstruction network increases, the performance of the segmentation network also improves synchronously. On seven datasets including five medical datasets and two infrared image datasets, our SRSNet consistently achieves the best segmentation results. The code is released at <a class="link-external link-https" href="https://github.com/fidshu/SRSNet" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the issue of weak target image segmentation, particularly the challenges encountered in medical image segmentation and infrared image segmentation. The authors analyzed the existing dynamic convolution methods and proposed Dynamic Parameter Convolution (DPConv), a novel convolution operation that can generate adaptive dynamic parameter kernels based on the data distribution of input features. Building on DPConv, the paper further introduces a dual-network architecture named Siamese Reconstruction-Segmentation Network (SRSNet), which enhances segmentation performance by transforming the segmentation task into a reconstruction task, without altering the network structure. SRSNet is not only a versatile network but also improves the performance of the segmentation network by leveraging the reconstruction task, without the need for additional segmentation task training data. As the amount of training data for the reconstruction network increases, the performance of the segmentation network also improves synchronously. On seven datasets, including five medical datasets and two infrared image datasets, SRSNet achieved the best segmentation results. Experimental results show that SRSNet's segmentation performance on ultrasound tumor images, dermatoscopic images, and computed tomography (CT) images is superior to existing techniques, especially in handling weak target segmentation tasks, such as ultrasound and CT images, where its performance advantage is more significant. Furthermore, the paper also demonstrates SRSNet's exceptional performance in infrared image segmentation, particularly when dealing with small infrared targets with scarce internal features, low target pixel occupancy, and complex backgrounds. This indicates that SRSNet has broad application potential and superiority in the field of weak target segmentation.