Abstract:BACKGROUND & AIMS: Capsule endoscopy has revolutionized investigation of the small bowel. However, this technique produces a video that is 8-10 hours long, so analysis is time consuming for gastroenterologists. Deep convolutional neural networks (CNNs) can recognize specific images among a large variety. We aimed to develop a CNN-based algorithm to assist in the evaluation of small bowel capsule endoscopy (SB-CE) images. METHODS: We collected 113,426,569 images from 6970 patients who had SB-CE at 77 medical centers from July 2016 through July 2018. A CNN-based auxiliary reading model was trained to differentiate abnormal from normal images using 158,235 SB-CE images from 1970 patients. Images were categorized as normal, inflammation, ulcer, polyps, lymphangiectasia, bleeding, vascular disease, protruding lesion, lymphatic follicular hyperplasia, diverticulum, parasite, and other. The model was further validated in 5000 patients (no patient was overlap with the 1970 patients in the training set); the same patients were evaluated by conventional analysis and CNN-based auxiliary analysis by 20 gastroenterologists. If there was agreement in image categorization between the conventional analysis and CNN model, no further evaluation was performed. If there was disagreement between the conventional analysis and CNN model, the gastroenterologists re-evaluated the image to confirm or reject the CNN categorization. RESULTS: In the SB-CE images from the validation set, 4206 abnormalities in 3280 patients were identified after final consensus evaluation. The CNN-based auxiliary model identified abnormalities with 99.88% sensitivity in the per-patient analysis (95% CI, 99.67-99.96) and 99.90% sensitivity in the per-lesion analysis (95% CI, 99.74-99.97). Conventional reading by the gastroenterologists identified abnormalities with 74.57% sensitivity (95% CI, 73.05-76.03) in the per-patient analysis and 76.89% in the per-lesion analysis (95% CI, 75.58-78.15). The mean reading time per patient was 96.6 +/- 22.53 minutes by conventional reading and 5.9 +/- 2.23 minutes by CNN-based auxiliary reading (P < .001). CONCLUSIONS: We validated the ability of a CNN-based algorithm to identify abnormalities in SB-CE images. The CNN-based auxiliary model identified abnormalities with higher levels of sensitivity and significantly shorter reading times than conventional analysis by gastroenterologists. This algorithm provides an important tool to help gastroenterologists analyze SB-CE images more efficiently and more accurately.

Classification of Endoscopy and Video Capsule Images using CNN-Transformer Model

Capsule Endoscopy Multi-classification via Gated Attention and Wavelet Transformations

Vision Transformer for Efficient Chest X-ray and Gastrointestinal Image Classification

Advanced CNN models in gastric cancer diagnosis: enhancing endoscopic image analysis with deep transfer learning

Convolutional‐capsule network for gastrointestinal endoscopy image classification

Multi-Class Abnormality Classification Task in Video Capsule Endoscopy

Video Capsule Endoscopy Classification using Focal Modulation Guided Convolutional Neural Network

Multi-Class Abnormality Classification in Video Capsule Endoscopy Using Deep Learning

Gastroenterologist-Level Identification of Small-Bowel Diseases and Normal Variants by Capsule Endoscopy Using a Deep-Learning Model

Early Cervical Cancer Diagnosis with SWIN-Transformer and Convolutional Neural Networks

CG‐Net: A novel CNN framework for gastrointestinal tract diseases classification

CapsuleNet: A Deep Learning Model To Classify GI Diseases Using EfficientNet-b7

ViTCA-Net: a framework for disease detection in video capsule endoscopy images using a vision transformer and convolutional neural network with a specific attention mechanism

Will Transformers change gastrointestinal endoscopic image analysis? A comparative analysis between CNNs and Transformers, in terms of performance, robustness and generalization

Endoscopy Classification Model Using Swin Transformer and Saliency Map

Pathological Insights: Enhanced Vision Transformers for the Early Detection of Colorectal Cancer

Detection of Angioectasias and Haemorrhages Incorporated into a Multi-Class Classification Tool for the GI Tract Anomalies by Using Binary CNNs

A Hybrid Convolutional Neural Network with Meta Feature Learning for Abnormality Detection in Wireless Capsule Endoscopy Images

Deep learning-based prediction model for diagnosing gastrointestinal diseases using endoscopy images

Automated Identification of Human Gastrointestinal Tract Abnormalities Based on Deep Convolutional Neural Network with Endoscopic Images

Gastrointestinal Disease Classification in Endoscopic Images Using Attention-Guided Convolutional Neural Networks