Vocal cord leukoplakia classification using deep learning models in white light and narrow band imaging endoscopy images
Zhenzhen You,Botao Han,Zhenghao Shi,Minghua Zhao,Shuangli Du,Jing Yan,Haiqin Liu,Xinhong Hei,Xiaoyong Ren,Yan Yan
DOI: https://doi.org/10.1002/hed.27543
2023-10-16
Head & Neck
Abstract:Background Accurate vocal cord leukoplakia classification is critical for the individualized treatment and early detection of laryngeal cancer. Numerous deep learning techniques have been proposed, but it is unclear how to select one to apply in the laryngeal tasks. This article introduces and reliably evaluates existing deep learning models for vocal cord leukoplakia classification. Methods We created white light and narrow band imaging (NBI) image datasets of vocal cord leukoplakia which were classified into six classes: normal tissues (NT), inflammatory keratosis (IK), mild dysplasia (MiD), moderate dysplasia (MoD), severe dysplasia (SD), and squamous cell carcinoma (SCC). Vocal cord leukoplakia classification was performed using six classical deep learning models, AlexNet, VGG, Google Inception, ResNet, DenseNet, and Vision Transformer. Results GoogLeNet (i.e., Google Inception V1), DenseNet‐121, and ResNet‐152 perform excellent classification. The highest overall accuracy of white light image classification is 0.9583, while the highest overall accuracy of NBI image classification is 0.9478. These three neural networks all provide very high sensitivity, specificity, and precision values. Conclusion GoogLeNet, ResNet, and DenseNet can provide accurate pathological classification of vocal cord leukoplakia. It facilitates early diagnosis, providing judgment on conservative treatment or surgical treatment of different degrees, and reducing the burden on endoscopists.
surgery,otorhinolaryngology