Abstract:The proliferation of video capsule endoscopy (VCE) would not have been possible without continued technological improvements in imaging and locomotion. Advancements in imaging include both software and hardware improvements but perhaps the greatest software advancement in imaging comes in the form of artificial intelligence (AI). Current research into AI in VCE includes the diagnosis of tumors, gastrointestinal bleeding, Crohn's disease, and celiac disease. Other advancements have focused on the improvement of both camera technologies and alternative forms of imaging. Comparatively, advancements in locomotion have just started to approach clinical use and include onboard controlled locomotion, which involves miniaturizing a motor to incorporate into the video capsule, and externally controlled locomotion, which involves using an outside power source to maneuver the capsule itself. Advancements in locomotion hold promise to remove one of the major disadvantages of VCE, namely, its inability to obtain targeted diagnoses. Active capsule control could in turn unlock additional diagnostic and therapeutic potential, such as the ability to obtain targeted tissue biopsies or drug delivery. With both advancements in imaging and locomotion has come a corresponding need to be better able to process generated images and localize the capsule's position within the gastrointestinal tract. Technological advancements in computation performance have led to improvements in image compression and transfer, as well as advancements in sensor detection and alternative methods of capsule localization. Together, these advancements have led to the expansion of VCE across a number of indications, including the evaluation of esophageal and colon pathologies including esophagitis, esophageal varices, Crohn's disease, and polyps after incomplete colonoscopy. Current research has also suggested a role for VCE in acute gastrointestinal bleeding throughout the gastrointestinal tract, as well as in urgent settings such as the emergency department, and in resource-constrained settings, such as during the COVID-19 pandemic. VCE has solidified its role in the evaluation of small bowel bleeding and earned an important place in the practicing gastroenterologist's armamentarium. In the next few decades, further improvements in imaging and locomotion promise to open up even more clinical roles for the video capsule as a tool for non-invasive diagnosis of lumenal gastrointestinal pathologies.

Galar - a large multi-label video capsule endoscopy dataset

Real-Time Multi-Label Upper Gastrointestinal Anatomy Recognition from Gastroscope Videos

Kvasir-Capsule, a video capsule endoscopy dataset

GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided Gastrointestinal Disease Detection

HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy

The intersection of video capsule endoscopy and artificial intelligence: addressing unique challenges using machine learning

REAL-Colon: A dataset for developing real-world AI applications in colonoscopy

Domain-Adaptive Pre-training of Self-Supervised Foundation Models for Medical Image Classification in Gastrointestinal Endoscopy

Efficient disease detection in gastrointestinal videos – global features versus neural networks

Automated Detection of Small Bowel Lesions Based on Capsule Endoscopy Using Deep Learning Algorithm

PS-DeVCEM: Pathology-sensitive deep learning model for video capsule endoscopy based on weakly labeled data

Endoscopic capsule robot-based diagnosis, navigation and localization in the gastrointestinal tract

Multi-Class Abnormality Classification Task in Video Capsule Endoscopy

Celiac Disease Diagnosis from Videocapsule Endoscopy Images with Residual Learning and Deep Feature Extraction.

Reduction of Video Capsule Endoscopy Reading Times Using Deep Learning with Small Data

Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Video Capsule Endoscopy in Gastroenterology

ViTCA-Net: a framework for disease detection in video capsule endoscopy images using a vision transformer and convolutional neural network with a specific attention mechanism

Capsule Vision 2024 Challenge: Multi-Class Abnormality Classification for Video Capsule Endoscopy

Fast machine learning annotation in the medical domain: a semi-automated video annotation tool for gastroenterologists