A Cross-attention and Multi-level Feature Fusion Network for Breast Lesion Segmentation in Ultrasound Images

Guoqi Liu,Yanan Zhou,Jiajia Wang,Zongyu Chen,Dong Liu,Baofang Chang
DOI: https://doi.org/10.1109/tim.2024.3418104
IF: 5.6
2024-01-01
IEEE Transactions on Instrumentation and Measurement
Abstract:Lesion segmentation in breast ultrasound images (BUSIs) plays a crucial role in the early diagnosis of diseases. Convolutional neural networks (CNNs) have demonstrated remarkable performance in BUSI segmentation. However, they struggle with capturing long-term dependencies in the images, which can reduce the accuracy of lesion segmentation. We propose a cross-attention and multilevel feature fusion network (CMFF-Net) for breast lesion segmentation, a novel hybrid CNN-transformer architecture. First, we propose a cross-attention feature fusion (CAFF) module that combines the global context information from the transformer with the local spatial information extracted by the CNN, compensating for the lost semantic information for the decoding phase and enhancing the feature representations. Second, we design a feature extraction module (FEM) to refine the local features of CNN. The original features can effectively establish the dependencies between features at the current size latitude. Finally, to improve the network performance, we construct a multilevel feature aggregation (MFA) module that adaptively computes the weights of different feature layers to ensure smoother interaction and integration. Extensive experiments on three public breast ultrasound datasets demonstrate that the proposed CMFF-Net outperforms other state-of-the-art (SOTA) methods.
What problem does this paper attempt to address?