Unsupervised Segmentation of Colonoscopy Images

Heming Yao,Jérôme Lüscher,Benjamin Gutierrez Becker,Josep Arús-Pous,Tommaso Biancalani,Amelie Bigorgne,David Richmond
2023-12-20
Abstract:Colonoscopy plays a crucial role in the diagnosis and prognosis of various gastrointestinal diseases. Due to the challenges of collecting large-scale high-quality ground truth annotations for colonoscopy images, and more generally medical images, we explore using self-supervised features from vision transformers in three challenging tasks for colonoscopy images. Our results indicate that image-level features learned from DINO models achieve image classification performance comparable to fully supervised models, and patch-level features contain rich semantic information for object detection. Furthermore, we demonstrate that self-supervised features combined with unsupervised segmentation can be used to discover multiple clinically relevant structures in a fully unsupervised manner, demonstrating the tremendous potential of applying these methods in medical image analysis.
Image and Video Processing,Artificial Intelligence,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to explore the application of unsupervised segmentation methods in colonoscopy image analysis. The specific objectives include: 1. **Unsupervised Semantic Segmentation**: Achieving unsupervised discovery of clinically relevant structures in colonoscopy images through self-supervised learning (SSL) combined with vision transformers (ViT). 2. **Addressing Annotation Challenges**: Due to the difficulty in obtaining large-scale high-quality annotated datasets, researchers attempt to overcome this challenge by utilizing self-supervised features. 3. **Multi-task Performance Evaluation**: Evaluating the effectiveness of self-supervised methods through three challenging tasks: image classification, object detection, and mucosal feature discovery. 4. **Mucosal Feature Recognition**: Exploring the potential of unsupervised methods in the automatic identification and quantification of mucosal features in colonoscopy videos. ### Main Findings - **Image Classification**: ViT trained based on the DINO framework performed excellently in image classification tasks, especially for the 23-class classification task and the Mayo Clinic Endoscopic Score (MCES) three-class classification task. - **Object Detection**: Self-supervised features also performed well in detecting polyps in colonoscopy images, showing competitiveness compared to fully supervised methods. - **Mucosal Feature Discovery**: The unsupervised segmentation method successfully discovered various interpretable and clinically relevant mucosal features, demonstrating the great potential of this approach in medical image analysis.