Multi‐Instance Learning for Vocal Fold Leukoplakia Diagnosis Using White Light and Narrow‐Band Imaging: A Multicenter Study

Cheng‐Wei Tie,De‐Yang Li,Ji‐Qing Zhu,Mei‐Ling Wang,Jian‐Hui Wang,Bing‐Hong Chen,Ying Li,Sen Zhang,Lin Liu,Li Guo,Long Yang,Li‐Qun Yang,Jiao Wei,Feng Jiang,Zhi‐Qiang Zhao,Gui‐Qi Wang,Wei Zhang,Quan‐Mao Zhang,Xiao‐Guang Ni
DOI: https://doi.org/10.1002/lary.31537
IF: 2.97
2024-05-28
The Laryngoscope
Abstract:In our study, we trained a multi‐instance learning (MIL)‐based artificial intelligence (AI) model on multi‐center white light imaging (WLI) and narrow band imaging (NBI) images. This model aims to assist in distinguishing the benign or malignant nature of vocal fold leukoplakia (VFL). Objectives Vocal fold leukoplakia (VFL) is a precancerous lesion of laryngeal cancer, and its endoscopic diagnosis poses challenges. We aim to develop an artificial intelligence (AI) model using white light imaging (WLI) and narrow‐band imaging (NBI) to distinguish benign from malignant VFL. Methods A total of 7057 images from 426 patients were used for model development and internal validation. Additionally, 1617 images from two other hospitals were used for model external validation. Modeling learning based on WLI and NBI modalities was conducted using deep learning combined with a multi‐instance learning approach (MIL). Furthermore, 50 prospectively collected videos were used to evaluate real‐time model performance. A human‐machine comparison involving 100 patients and 12 laryngologists assessed the real‐world effectiveness of the model. Results The model achieved the highest area under the receiver operating characteristic curve (AUC) values of 0.868 and 0.884 in the internal and external validation sets, respectively. AUC in the video validation set was 0.825 (95% CI: 0.704–0.946). In the human‐machine comparison, AI significantly improved AUC and accuracy for all laryngologists (p
medicine, research & experimental,otorhinolaryngology
What problem does this paper attempt to address?