Abstract:Real-time visual feedback from catheterization analysis is crucial for enhancing surgical safety and efficiency during endovascular interventions. However, existing datasets are often limited to specific tasks, small scale, and lack the comprehensive annotations necessary for broader endovascular intervention understanding. To tackle these limitations, we introduce CathAction, a large-scale dataset for catheterization understanding. Our CathAction dataset encompasses approximately 500,000 annotated frames for catheterization action understanding and collision detection, and 25,000 ground truth masks for catheter and guidewire segmentation. For each task, we benchmark recent related works in the field. We further discuss the challenges of endovascular intentions compared to traditional computer vision tasks and point out open research questions. We hope that CathAction will facilitate the development of endovascular intervention understanding methods that can be applied to real-world applications. The dataset is available at <a class="link-external link-https" href="https://airvlab.github.io/cathaction/" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the limitations of the existing endovascular interventional datasets. These limitations include small scale, single - task (such as being limited to segmentation only), lack of comprehensive annotations, and most datasets being private, etc. These problems limit the application and development of deep - learning methods in this field. Specifically, the paper points out that the current endovascular interventional datasets have the following problems: 1. **Small scale**: Due to the high cost of collecting real - world medical data, the existing datasets usually contain a small number of images. 2. **Single - task**: Most of the existing datasets are for specific tasks (such as segmentation) and cannot support other important tasks (such as collision detection or action understanding). 3. **Data privatization**: Due to the privacy challenges in the medical field, most of the existing endovascular interventional datasets are private and not easily accessible. 4. **Lack of comprehensive annotations**: The existing datasets lack comprehensive annotations for multiple tasks, which limits their use in broader applications. To address these problems, the authors propose the CathAction dataset, which is a large - scale endovascular interventional dataset aiming to cover multiple tasks, such as segmentation, collision detection, and action understanding. By providing a large amount of annotated data, CathAction is expected to promote the development of endovascular interventional understanding methods and drive the application of these methods in practical applications. ### Main contributions 1. **Introduction of the CathAction dataset**: This dataset provides manually - annotated real - data, covering multiple tasks such as segmentation, action understanding, and collision detection. 2. **Benchmark testing**: Benchmark tests were carried out for key tasks in endovascular interventional, including catheter insertion prediction, identification, segmentation, and collision detection. 3. **Discussion of challenges and open questions**: The challenges in the field of endovascular interventional were explored, and future research directions were pointed out. The code and dataset are publicly available. ### Dataset features - **Large scale**: It contains approximately 500,000 annotated frames for action understanding and collision detection, and approximately 25,000 ground - truth masks for catheter and guide - wire segmentation. - **Diversity**: The data are from simulation models (phantom) and real - animal experiments, ensuring the diversity and realism of the data. - **Multi - task support**: It not only supports the segmentation task, but also supports the action understanding and collision detection tasks. Through these improvements, the CathAction dataset provides strong support for the understanding and analysis of endovascular interventional, which helps to improve the safety and efficiency of surgeries.

CathAction: A Benchmark for Endovascular Intervention Understanding

CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers

CathSim: An Open-source Simulator for Endovascular Intervention

ImageCAS: A Large-Scale Dataset and Benchmark for Coronary Artery Segmentation based on Computed Tomography Angiography Images

Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection

Cataract-1K Dataset for Deep-Learning-Assisted Analysis of Cataract Surgery Videos

ESAD: Endoscopic Surgeon Action Detection Dataset

Autonomous Catheterization with Open-source Simulator and Expert Trajectory

Auxiliary Input in Training: Incorporating Catheter Features into Deep Learning Models for ECG-Free Dynamic Coronary Roadmapping

OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding

OCTA-500: A Retinal Dataset for Optical Coherence Tomography Angiography Study

CATARACTS: Challenge on automatic tool annotation for cataRACT surgery

CADICA: a new dataset for coronary artery disease detection by using invasive coronary angiography

SAR-RARP50: Segmentation of surgical instrumentation and Action Recognition on Robot-Assisted Radical Prostatectomy Challenge

2020 CATARACTS Semantic Segmentation Challenge

Phase-Informed Tool Segmentation for Manual Small-Incision Cataract Surgery

CaDIS: Cataract Dataset for Image Segmentation

The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and Benchmark

CathAI: Fully Automated Interpretation of Coronary Angiograms Using Neural Networks

AiAReSeg: Catheter Detection and Segmentation in Interventional Ultrasound using Transformers