Abstract:Accurate and automatic segmentation of medical images is a key step in clinical diagnosis and analysis. Currently, the successful application of Transformers' model in the field of computer vision, researchers have begun to gradually explore the application of Transformers in medical segmentation of images, especially in combination with convolutional neural networks with coding-decoding structure, which have achieved remarkable results in the field of medical segmentation. However, most studies have combined Transformers with CNNs at a single scale or processed only the highest-level semantic feature information, ignoring the rich location information in the lower-level semantic feature information. At the same time, for problems such as blurred structural boundaries and heterogeneous textures in images, most existing methods usually simply connect contour information to capture the boundaries of the target. However, these methods cannot capture the precise outline of the target and ignore the potential relationship between the boundary and the region. In this paper, we propose the TGDAUNet, which consists of a dual-branch backbone network of CNNs and Transformers and a parallel attention mechanism, to achieve accurate segmentation of lesions in medical images. Firstly, high-level semantic feature information of the CNN backbone branches is fused at multiple scales, and the high-level and low-level feature information complement each other's location and spatial information. We further use the polarised self-attentive (PSA) module to reduce the impact of redundant information caused by multiple scales, to better couple with the feature information extracted from the Transformers backbone branch, and to establish global contextual long-range dependencies at multiple scales. In addition, we have designed the Reverse Graph-reasoned Fusion (RGF) module and the Feature Aggregation (FA) module to jointly guide the global context. The FA module aggregates high-level semantic feature information to generate an original global predictive segmentation map. The RGF module captures non-significant features of the boundaries in the original or secondary global prediction segmentation graph through a reverse attention mechanism, establishing a graph reasoning module to explore the potential semantic relationships between boundaries and regions, further refining the target boundaries. Finally, to validate the effectiveness of our proposed method, we compare our proposed method with the current popular methods in the CVC-ClinicDB, Kvasir-SEG, ETIS, CVC-ColonDB, CVC-300,datasets as well as the skin cancer segmentation datasets ISIC-2016 and ISIC-2017. The large number of experimental results show that our method outperforms the currently popular methods. Source code is released at https://github.com/sd-spf/TGDAUNet.

DMSA-UNet: Dual Multi-Scale Attention makes UNet more strong for medical image segmentation

Mixed Transformer U-Net for Medical Image Segmentation

MDA-Unet: A Multi-Scale Dilated Attention U-Net for Medical Image Segmentation

MSCA-UNet: multi-scale channel attention-based UNet for segmentation of medical ultrasound images

MSR-UNet: enhancing multi-scale and long-range dependencies in medical image segmentation

UNet based on dynamic convolution decomposition and triplet attention

DA-TransUNet: Integrating Spatial and Channel Dual Attention with Transformer U-Net for Medical Image Segmentation

Problems in the interpretation of serological results of hepatitis B testing during an incident of hepatitis B virus reactivation in a dialysis unit.

A multi-attention and depthwise separable convolution network for medical image segmentation

MA-Unet: An improved version of Unet based on multi-scale and attention mechanism for medical image segmentation

Multiresolution Aggregation Transformer UNet Based on Multiscale Input and Coordinate Attention for Medical Image Segmentation

TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation

MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical Image Segmentation

DmADs-Net: Dense multiscale attention and depth-supervised network for medical image segmentation

DW-SCA Unet: medical image segmentation based on depth-wise separable convolutional attention U-shaped network

MSRD-Unet: Multiscale Residual Dilated U-Net for Medical Image Segmentation

MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation

A Multi-Scale Context Aware Attention Model for Medical Image Segmentation

MS-UNet: Multi-Scale Nested UNet for Medical Image Segmentation with Few Training Data Based on an ELoss and Adaptive Denoising Method

MD-UNet: a medical image segmentation network based on mixed depthwise convolution

Sequential grafting of the right gastroepiploic artery in coronary artery bypass surgery.