SPIRONet: Spatial-Frequency Learning and Topological Channel Interaction Network for Vessel Segmentation

De-Xing Huang,Xiao-Hu Zhou,Xiao-Liang Xie,Shi-Qi Liu,Shuang-Yi Wang,Zhen-Qiu Feng,Mei-Jiang Gui,Hao Li,Tian-Yu Xiang,Bo-Xian Yao,Zeng-Guang Hou
2024-06-28
Abstract:Automatic vessel segmentation is paramount for developing next-generation interventional navigation systems. However, current approaches suffer from suboptimal segmentation performances due to significant challenges in intraoperative images (i.e., low signal-to-noise ratio, small or slender vessels, and strong interference). In this paper, a novel spatial-frequency learning and topological channel interaction network (SPIRONet) is proposed to address the above issues. Specifically, dual encoders are utilized to comprehensively capture local spatial and global frequency vessel features. Then, a cross-attention fusion module is introduced to effectively fuse spatial and frequency features, thereby enhancing feature discriminability. Furthermore, a topological channel interaction module is designed to filter out task-irrelevant responses based on graph neural networks. Extensive experimental results on several challenging datasets (CADSA, CAXF, DCA1, and XCAD) demonstrate state-of-the-art performances of our method. Moreover, the inference speed of SPIRONet is 21 FPS with a 512x512 input size, surpassing clinical real-time requirements (6~12FPS). These promising outcomes indicate SPIRONet's potential for integration into vascular interventional navigation systems. Code is available at <a class="link-external link-https" href="https://github.com/Dxhuang-CASIA/SPIRONet" rel="external noopener nofollow">this https URL</a>.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper proposes a novel method called SPIRONet (SPatial-frequency Learning and TopologIcal Channel InteR action Network) to solve the problem of blood vessel segmentation. Automatic blood vessel segmentation is crucial in interventional navigation systems, but existing methods perform poorly in handling challenges such as low signal-to-noise ratio, small blood vessels, and interference. SPIRONet captures local spatial features and global frequency features separately through dual encoders, and introduces a cross-attention fusion module to enhance feature discriminability. In addition, it designs a topological channel interaction module to filter irrelevant responses using graph neural networks. Experimental results demonstrate that SPIRONet achieves state-of-the-art performance on multiple challenging datasets and has inference speed exceeding the real-time requirements of clinical applications, indicating its potential in the application of blood vessel interventional navigation systems.