CephalFormer: Incorporating Global Structure Constraint into Visual Features for General Cephalometric Landmark Detection
Yankai Jiang,Yiming Li,Xinyue Wang,Yubo Tao,Jun Lin,Hai Lin
DOI: https://doi.org/10.1007/978-3-031-16437-8_22
2022-01-01
Abstract:Accurate cephalometric landmark detection is a crucial step in orthodontic diagnosis and therapy planning However, existing deep learning-based methods lack the ability to explicitly model the complex dependencies among visual features and landmarks. Therefore, they fail to adaptively encode the landmark's global structure constraint into the representation of visual concepts and suffer from large biases in landmark localization. In this work, we propose CephalFormer, which exploits the correlations between visual concepts and landmarks to provide meaningful guidance for accurate 2D and 3D cephalometric landmark detection. CephalFormer explores local-global anatomical contents in a coarse-to-fine fashion and consists of two stages: (1) a new efficient Transformer-based architecture for coarse landmark localization; (2) a novel paradigm based on self-attention to represent visual clues and landmarks in one coherent feature space for fine-scale landmark detection. We evaluated CephalFormer on two public cephalometric landmark detection benchmarks and a real-patient dataset consisting of 150 skull CBCT volumes. Experiments show that CephalFormer significantly outperforms the state-of-the-art methods, demonstrating its generalization capability and stability to naturally handle both 2D and 3D scenarios under a unified framework.
What problem does this paper attempt to address?