Anchor Ball Regression Model for large-scale 3D skull landmark detection

Tao He,Guikun Xu,Li Cui,Wei Tang,Jie Long,Jixiang Guo
DOI: https://doi.org/10.1016/j.neucom.2023.127051
IF: 6
2024-01-01
Neurocomputing
Abstract:Recent deep learning models have exhibited impressive performance in the area of 3D skull landmark detection, but most of them aimed to detect a fixed number of landmarks. This paper focuses on automatically detecting an arbitrary number of landmarks on CT volumes, which meets the real clinical needs. To achieve robust performance for detecting arbitrary molar landmarks, we propose a novel 3D landmark detection model named Anchor Ball Regression Model (ABRM), which combines landmark detection and landmark classification losses for network training. For landmark detection, a novel landmark regression loss is proposed by predicting offsets to anchor balls instead of directly predicting landmarks. For landmark classification, an online hard negative mining loss is used to reduce absent landmarks’ learning errors, and a small regularization constraint loss is performed for voxels outside the anchor balls. The network backbone of ABRM is obtained by manually pruning popular 3D-CNNs. We also present an available large-scale benchmark dataset in this paper, which, to the best of our knowledge, is the largest dataset for 3D skull landmark detection. The dataset comprises of 658 CT volumes, with 14 landmarks labeled by two junior and one senior doctors. The ABRM presents a good robust performance and outperforms other models on this dataset. The codes and dataset are accessible at https://github.com/ithet1007/mmld_code.
computer science, artificial intelligence
What problem does this paper attempt to address?