CathAction: A Benchmark for Endovascular Intervention Understanding

Baoru Huang,Tuan Vo,Chayun Kongtongvattana,Giulio Dagnino,Dennis Kundrat,Wenqiang Chi,Mohamed Abdelaziz,Trevor Kwok,Tudor Jianu,Tuong Do,Hieu Le,Minh Nguyen,Hoan Nguyen,Erman Tjiputra,Quang Tran,Jianyang Xie,Yanda Meng,Binod Bhattarai,Zhaorui Tan,Hongbin Liu,Hong Seng Gan,Wei Wang,Xi Yang,Qiufeng Wang,Jionglong Su,Kaizhu Huang,Angelos Stefanidis,Min Guo,Bo Du,Rong Tao,Minh Vu,Guoyan Zheng,Yalin Zheng,Francisco Vasconcelos,Danail Stoyanov,Daniel Elson,Ferdinando Rodriguez y Baena,Anh Nguyen
2024-08-30
Abstract:Real-time visual feedback from catheterization analysis is crucial for enhancing surgical safety and efficiency during endovascular interventions. However, existing datasets are often limited to specific tasks, small scale, and lack the comprehensive annotations necessary for broader endovascular intervention understanding. To tackle these limitations, we introduce CathAction, a large-scale dataset for catheterization understanding. Our CathAction dataset encompasses approximately 500,000 annotated frames for catheterization action understanding and collision detection, and 25,000 ground truth masks for catheter and guidewire segmentation. For each task, we benchmark recent related works in the field. We further discuss the challenges of endovascular intentions compared to traditional computer vision tasks and point out open research questions. We hope that CathAction will facilitate the development of endovascular intervention understanding methods that can be applied to real-world applications. The dataset is available at <a class="link-external link-https" href="https://airvlab.github.io/cathaction/" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the limitations of the existing endovascular interventional datasets. These limitations include small scale, single - task (such as being limited to segmentation only), lack of comprehensive annotations, and most datasets being private, etc. These problems limit the application and development of deep - learning methods in this field. Specifically, the paper points out that the current endovascular interventional datasets have the following problems: 1. **Small scale**: Due to the high cost of collecting real - world medical data, the existing datasets usually contain a small number of images. 2. **Single - task**: Most of the existing datasets are for specific tasks (such as segmentation) and cannot support other important tasks (such as collision detection or action understanding). 3. **Data privatization**: Due to the privacy challenges in the medical field, most of the existing endovascular interventional datasets are private and not easily accessible. 4. **Lack of comprehensive annotations**: The existing datasets lack comprehensive annotations for multiple tasks, which limits their use in broader applications. To address these problems, the authors propose the CathAction dataset, which is a large - scale endovascular interventional dataset aiming to cover multiple tasks, such as segmentation, collision detection, and action understanding. By providing a large amount of annotated data, CathAction is expected to promote the development of endovascular interventional understanding methods and drive the application of these methods in practical applications. ### Main contributions 1. **Introduction of the CathAction dataset**: This dataset provides manually - annotated real - data, covering multiple tasks such as segmentation, action understanding, and collision detection. 2. **Benchmark testing**: Benchmark tests were carried out for key tasks in endovascular interventional, including catheter insertion prediction, identification, segmentation, and collision detection. 3. **Discussion of challenges and open questions**: The challenges in the field of endovascular interventional were explored, and future research directions were pointed out. The code and dataset are publicly available. ### Dataset features - **Large scale**: It contains approximately 500,000 annotated frames for action understanding and collision detection, and approximately 25,000 ground - truth masks for catheter and guide - wire segmentation. - **Diversity**: The data are from simulation models (phantom) and real - animal experiments, ensuring the diversity and realism of the data. - **Multi - task support**: It not only supports the segmentation task, but also supports the action understanding and collision detection tasks. Through these improvements, the CathAction dataset provides strong support for the understanding and analysis of endovascular interventional, which helps to improve the safety and efficiency of surgeries.