Abstract:A custom‐made algorithm called SegMENT‐Plus was trained on 3933 laryngeal carcinoma images obtained by 557 patients. The model achieved Dice similarity coefficient of 0.827, Intersection over the union of 0.828, accuracy of 0.972, and inference speed of 25.6 fps, thus reaching real‐time performances. SegMENT‐Plus performed similarly on two external validation datasets. The performances of the model showed no significant differences from those obtained by two residents. The implementation of artificial intelligence during laryngoscopy can support clinicians in delineating the superficial extent of laryngeal cancer. SegMENT‐Plus showed reliable results, with performances equal to those of two otolaryngology residents and with computation speed. Objective To investigate the potential of deep learning for automatically delineating (segmenting) laryngeal cancer superficial extent on endoscopic images and videos. Methods A retrospective study was conducted extracting and annotating white light (WL) and Narrow‐Band Imaging (NBI) frames to train a segmentation model (SegMENT‐Plus). Two external datasets were used for validation. The model's performances were compared with those of two otolaryngology residents. In addition, the model was tested on real intraoperative laryngoscopy videos. Results A total of 3933 images of laryngeal cancer from 557 patients were used. The model achieved the following median values (interquartile range): Dice Similarity Coefficient (DSC) = 0.83 (0.70–0.90), Intersection over Union (IoU) = 0.83 (0.73–0.90), Accuracy = 0.97 (0.95–0.99), Inference Speed = 25.6 (25.1–26.1) frames per second. The external testing cohorts comprised 156 and 200 images. SegMENT‐Plus performed similarly on all three datasets for DSC (p = 0.05) and IoU (p = 0.07). No significant differences were noticed when separately analyzing WL and NBI test images on DSC (p = 0.06) and IoU (p = 0.78) and when analyzing the model versus the two residents on DSC (p = 0.06) and IoU (Senior vs. SegMENT‐Plus, p = 0.13; Junior vs. SegMENT‐Plus, p = 1.00). The model was then tested on real intraoperative laryngoscopy videos. Conclusion SegMENT‐Plus can accurately delineate laryngeal cancer boundaries in endoscopic images, with performances equal to those of two otolaryngology residents. The results on the two external datasets demonstrate excellent generalization capabilities. The computation speed of the model allowed its application on videolaryngoscopies simulating real‐time use. Clinical trials are needed to evaluate the role of this technology in surgical practice and resection margin improvement. Level of Evidence III Laryngoscope, 2024

Predicting semantic segmentation quality in laryngeal endoscopy images

Semi-Supervised Learning for Semantic Segmentation of Emphysema With Partial Annotations

A Dataset of Laryngeal Endoscopic Images with Comparative Study on Convolution Neural Network Based Semantic Segmentation

Deep Learning-Based Detection of Glottis Segmentation Failures

LapSeg3D: Weakly Supervised Semantic Segmentation of Point Clouds Representing Laparoscopic Scenes

An automated approach for real-time informative frames classification in laryngeal endoscopy using deep learning

Experimental Framework for Generating Reliable Ground Truth for Laryngeal Spatial Segmentation Tasks

Development of deep learning segmentation models for coronary X-ray angiography: Quality assessment by a new global segmentation score and comparison with human performance

Real-time Prediction of Segmentation Quality

Using Machine Learning for Endoscopic Detection of Low-Grade Subglottic Stenosis: A Proof of Principle

Laryngeal Image Dataset Automatic Annotation and Classification of Laryngeal Disease

Redefining the Laparoscopic Spatial Sense: AI-based Intra- and Postoperative Measurement from Stereoimages

How can we learn (more) from challenges? A statistical approach to driving future algorithm development

Quality Control-Driven Image Segmentation Towards Reliable Automatic Image Analysis in Large-Scale Cardiovascular Magnetic Resonance Aortic Cine Imaging

Vessel and tissue recognition during third-space endoscopy using a deep learning algorithm

Real‐Time Laryngeal Cancer Boundaries Delineation on White Light and Narrow‐Band Imaging Laryngoscopy with Deep Learning

Deep learning for real-time multi-class segmentation of artefacts in lung ultrasound

Segmentation quality assessment by automated detection of erroneous surface regions in medical images

Leveraging weak complementary labels to improve semantic segmentation of hepatocellular carcinoma and cholangiocarcinoma in H&E-stained slides

Weakly Supervised Airway Orifice Segmentation in Video Bronchoscopy

Hierarchical segmentation of surgical scenes in laparoscopy