Transfer learning for anatomical structure segmentation in otorhinolaryngology microsurgery

Xin Ding,Yu Huang,Yang Zhao,Xu Tian,Guodong Feng,Zhiqiang Gao
DOI: https://doi.org/10.1002/rcs.2634
2024-05-22
International Journal of Medical Robotics and Computer Assisted Surgery
Abstract:Abstract Background Reducing the annotation burden is an active and meaningful area of artificial intelligence (AI) research. Methods Multiple datasets for the segmentation of two landmarks were constructed based on 41 257 labelled images and 6 different microsurgical scenarios. These datasets were trained using the multi‐stage transfer learning (TL) methodology. Results The multi‐stage TL enhanced segmentation performance over baseline (mIOU 0.6892 vs. 0.8869). Besides, Convolutional Neural Networks (CNNs) achieved a robust performance (mIOU 0.8917 vs. 0.8603) even when the training dataset size was reduced from 90% (30 078 images) to 10% (3342 images). When directly applying the weight from one certain surgical scenario to recognise the same target in images of other scenarios without training, CNNs still obtained an optimal mIOU of 0.6190 ± 0.0789. Conclusions Model performance can be improved with TL in datasets with reduced size and increased complexity. It is feasible for data‐based domain adaptation among different microsurgical fields.
surgery
What problem does this paper attempt to address?