Comparing artificial intelligence to humans for endoscopic diagnosis of gastric neoplasia: An external validation study

Sabrina Xin Zi Quek,Jonathan W. J. Lee,Zhu Feng,Min Min Soh,Masayuki Tokano,Yeoh Khay Guan,Jimmy B. Y. So,Tomohiro Tada,Calvin J. Koh
DOI: https://doi.org/10.1111/jgh.16274
2023-07-06
Journal of Gastroenterology and Hepatology
Abstract:Objectives Artificial intelligence (AI) uses deep learning functionalities that may enhance the detection of early gastric cancer during endoscopy. An AI‐based endoscopic system for upper endoscopy was recently developed in Japan. We aim to validate this AI‐based system in a Singaporean cohort. Methods There were 300 de‐identified still images prepared from endoscopy video files obtained from subjects that underwent gastroscopy in National University Hospital (NUH). Five specialists and 6 non‐specialists (trainees) from NUH were assigned to read and categorize the images into "neoplastic" or "non‐neoplastic." Results were then compared with the readings performed by the endoscopic AI system. Results The mean accuracy, sensitivity, and specificity for the 11 endoscopists were 0.847, 0.525, and 0.872, respectively. These values for the AI‐based system were 0.777, 0.591, and 0.791, respectively. While AI in general did not perform better than endoscopists on the whole, in the subgroup of high‐grade dysplastic lesions, only 29.1% were picked up by the endoscopist rating, but 80% were classified as neoplastic by AI (P = 0.0011). The average diagnostic time was also faster in AI compared with endoscopists (677.1 s vs 42.02 s (P
gastroenterology & hepatology
What problem does this paper attempt to address?