MAFNet: A deep multi-scale attentive fusion network for virtual osteotomy of maxillofacial bones in CT images containing metal artifacts

Su Yang,Ji-Yong Yoo,Sang-Jeong Lee,Se-Ryong Kang,Jun-Min Kim,Jo-Eun Kim,Kyung-Hoe Huh,Sam-Sun Lee,Min-Suk Heo,Hoon Joo Yang,Won-Jin Yi
DOI: https://doi.org/10.1016/j.bspc.2024.106411
IF: 5.1
2024-05-19
Biomedical Signal Processing and Control
Abstract:An essential step for 3D virtual surgical planning in orthognathic surgery is image segmentation to generate virtual 3D models for virtual osteotomy of the maxillofacial bone in CT images. However, this manual segmentation process is time-consuming and labor-intensive and requires expertise. Also, most conventional automatic segmentation methods can not directly segment the maxillofacial bone for virtual Le Fort I osteotomy. The purpose of this study was to automatically and robustly segment the maxillofacial bone for virtual Le Fort I osteotomy in CT images using a deep multi-scale attentive fusion network (MAFNet). MAFNet consisted of multi-scale encoders (MEs), an atrous feature fusion module (AFFM), a multi-scale spatial attention module (MSAM), and a weighted soft combo loss (SCL) with deep supervision. In performance comparisons, MAFNet outperformed popular segmentation networks by achieving 0.836, 0.951, and 0.929 in terms of F1-score for the maxilla, mandible, and skull segmentation, respectively. In ablation studies, the proposed components and SCL improved segmentation performance by allowing the network to learn multi-scale feature representations and anatomical contexts efficiently. MEs and AFFM captured multi-scale context features and mitigated the loss of spatial information. MSAM emphasized the boundary details, such as an osteotomy line between the maxilla and skull, and suppressed irrelevant background details, such as metal artifacts and soft tissues. SCL with deep supervision helped MAFNet alleviate the class imbalance problem and improve segmentation performance. Therefore, MAFNet outperformed popular segmentation networks and obtained robust segmentation results on challenging CT images including osteotomy lines, metal artifacts, and malocclusions in CT images.
engineering, biomedical
What problem does this paper attempt to address?