Multi-View Vertebra Localization and Identification from CT Images

Han Wu,Jiadong Zhang,Yu Fang,Zhentao Liu,Nizhuan Wang,Zhiming Cui,Dinggang Shen
2023-07-24
Abstract:Accurately localizing and identifying vertebrae from CT images is crucial for various clinical applications. However, most existing efforts are performed on 3D with cropping patch operation, suffering from the large computation costs and limited global information. In this paper, we propose a multi-view vertebra localization and identification from CT images, converting the 3D problem into a 2D localization and identification task on different views. Without the limitation of the 3D cropped patch, our method can learn the multi-view global information naturally. Moreover, to better capture the anatomical structure information from different view perspectives, a multi-view contrastive learning strategy is developed to pre-train the backbone. Additionally, we further propose a Sequence Loss to maintain the sequential structure embedded along the vertebrae. Evaluation results demonstrate that, with only two 2D networks, our method can localize and identify vertebrae in CT images accurately, and outperforms the state-of-the-art methods consistently. Our code is available at <a class="link-external link-https" href="https://github.com/ShanghaiTech-IMPACT/Multi-View-Vertebra-Localization-and-Identification-from-CT-Images" rel="external noopener nofollow">this https URL</a>.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### The Problem the Paper Attempts to Solve The paper aims to address the problem of accurately locating and identifying vertebrae from CT images. In clinical applications such as surgical planning, pathological diagnosis, and postoperative evaluation, accurate localization and identification of vertebrae are crucial. However, most existing methods operate in 3D space and handle the task through patch cropping operations, which result in significant computational costs and limited global information acquisition. To overcome these issues, the authors propose a multi-view vertebrae localization and identification method that transforms the 3D problem into 2D localization and identification tasks on different views. This approach not only avoids the limitations of 3D patch cropping but also naturally learns global information from multiple views. Additionally, the authors develop a multi-view contrastive learning strategy to pre-train the backbone network and introduce a Sequence Loss to maintain the sequential structure of vertebrae along the spine. Experimental results demonstrate that this method can accurately locate and identify vertebrae in CT images using only two 2D networks and performs excellently on multiple benchmarks.