Balancing High-performance and Lightweight: HL-UNet for 3D Cardiac Medical Image Segmentation
Hai Zhou,Binbin Hu,Nengmin Yi,Qingtai Li,Daji Ergu,Fangyao Liu
DOI: https://doi.org/10.1016/j.acra.2024.06.008
IF: 5.482
2024-06-21
Academic Radiology
Abstract:Rationale and Objectives Cardiac magnetic resonance imaging is a crucial tool for analyzing, diagnosing, and formulating treatment plans for cardiovascular diseases. Currently, there is very little research focused on balancing cardiac segmentation performance with lightweight methods. Despite the existence of numerous efficient image segmentation algorithms, they primarily rely on complex and computationally intensive network models, making it challenging to implement them on resource-constrained medical devices. Furthermore, simplified models designed to meet the requirements of device lightweighting may have limitations in comprehending and utilizing both global and local information for cardiac segmentation. Materials and Methods We propose a novel 3D high-performance lightweight medical image segmentation network, HL-UNet, for application in cardiac image segmentation. Specifically, in HL-UNet, we propose a novel residual-enhanced Adaptive attention (REAA) module that combines residual-enhanced connectivity with an adaptive attention mechanism to efficiently capture key features of input images and optimize their representation capabilities, and integrates the Visual Mamba (VSS) module to enhance the performance of HL-UNet. Results Compared to large-scale models such as TransUNet, HL-UNet increased the Dice of the right ventricular cavity (RV), left ventricular myocardia (MYO), and left ventricular cavity (LV), the key indicators of cardiac image segmentation, by 1.61%, 5.03% and 0.19%, respectively. At the same time, the Params and FLOPs of the model decreased by 41.3 M and 31.05 G, respectively. Furthermore, compared to lightweight models such as the MISSFormer, the HL-UNet improves the Dice of RV, MYO, and LV by 4.11%, 3.82%, and 4.33%, respectively, when the number of parameters and computational complexity are close to or even lower. Conclusion The proposed HL-UNet model captures local details and edge information in images while being lightweight. Experimental results show that compared with large-scale models, HL-UNet significantly reduces the number of parameters and computational complexity while maintaining performance, thereby increasing frames per second (FPS). Compared to lightweight models, HL-UNet shows substantial improvements across various key metrics, with parameter count and computational complexity approaching or even lower.
radiology, nuclear medicine & medical imaging