Automatic assessment of bowel preparation by an artificial intelligence model and its clinical applicability

Ji Young Lee,Jooyoung Park,Hyo Jeong Lee,Hana Park,Eun Hyo Jin,Kanggil Park,Ji Eun Baek,Dong‐Hoon Yang,Seung Wook Hong,Namkug Kim,Jeong‐Sik Byeon
DOI: https://doi.org/10.1111/jgh.16618
2024-05-22
Journal of Gastroenterology and Hepatology
Abstract:Background and Aim Reliable bowel preparation assessment is important in colonoscopy. However, current scoring systems are limited by laborious and time‐consuming tasks and interobserver variability. We aimed to develop an artificial intelligence (AI) model to assess bowel cleanliness and evaluate its clinical applicability. Methods A still image‐driven AI model to assess the Boston Bowel Preparation Scale (BBPS) was developed and validated using 2361 colonoscopy images. For evaluating real‐world applicability, the model was validated using 113 10‐s colonoscopy video clips and 30 full colonoscopy videos to identify "adequate (BBPS 2–3)" or "inadequate (BBPS 0–1)" preparation. The model was tested with an external dataset of 29 colonoscopy videos. The clinical applicability of the model was evaluated using 225 consecutive colonoscopies. Inter‐rater variability was analyzed between the AI model and endoscopists. Results The AI model achieved an accuracy of 94.0% and an area under the receiver operating characteristic curve of 0.939 with the still images. Model testing with an external dataset showed an accuracy of 95.3%, an area under the receiver operating characteristic curve of 0.976, and a sensitivity of 100% for the detection of inadequate preparations. The clinical applicability study showed an overall agreement rate of 85.3% between endoscopists and the AI model, with Fleiss' kappa of 0.686. The agreement rate was lower for the right colon compared with the transverse and left colon, with Fleiss' kappa of 0.563, 0.575, and 0.789, respectively. Conclusions The AI model demonstrated accurate bowel preparation assessment and substantial agreement with endoscopists. Further refinement of the AI model is warranted for effective monitoring of qualified colonoscopy in large‐scale screening programs.
gastroenterology & hepatology
What problem does this paper attempt to address?