A hybrid approach for improving U-Net variants in medical image segmentation

Aitik Gupta,Dr. Joydip Dhar
2023-07-31
Abstract:Medical image segmentation is vital to the area of medical imaging because it enables professionals to more accurately examine and understand the information offered by different imaging modalities. The technique of splitting a medical image into various segments or regions of interest is known as medical image segmentation. The segmented images that are produced can be used for many different things, including diagnosis, surgery planning, and therapy evaluation. In initial phase of research, major focus has been given to review existing deep-learning approaches, including researches like MultiResUNet, Attention U-Net, classical U-Net, and other variants. The attention feature vectors or maps dynamically add important weights to critical information, and most of these variants use these to increase accuracy, but the network parameter requirements are somewhat more stringent. They face certain problems such as overfitting, as their number of trainable parameters is very high, and so is their inference time. Therefore, the aim of this research is to reduce the network parameter requirements using depthwise separable convolutions, while maintaining performance over some medical image segmentation tasks such as skin lesion segmentation using attention system and residual connections.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper aims to address several key issues in medical image segmentation, specifically optimizing the current deep learning methods. Specifically, the research goal is to reduce the number of trainable parameters in the network while maintaining performance. The paper mentions that existing U-Net variants (such as MultiResUNet, Attention U-Net, etc.) have improved accuracy, but these methods usually require a large number of trainable parameters, leading to overfitting issues and longer inference times. To solve these problems, the authors propose a hybrid approach that uses depthwise separable convolutions to reduce the number of network parameters, while maintaining or improving segmentation performance through attention mechanisms and residual connections. This method is particularly suitable for tasks such as skin lesion segmentation. Through this optimization, not only can the model complexity be reduced, but also the model's generalization ability and computational efficiency can be improved.