CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers

Alex Ranne,Liming Kuang,Yordanka Velikova,Nassir Navab,Ferdinando Rodriguez y Baena
2024-09-10
Abstract:In minimally invasive endovascular procedures, contrast-enhanced angiography remains the most robust imaging technique. However, it is at the expense of the patient and clinician's health due to prolonged radiation exposure. As an alternative, interventional ultrasound has notable benefits such as being radiation-free, fast to deploy, and having a small footprint in the operating room. Yet, ultrasound is hard to interpret, and highly prone to artifacts and noise. Additionally, interventional radiologists must undergo extensive training before they become qualified to diagnose and treat patients effectively, leading to a shortage of staff, and a lack of open-source datasets. In this work, we seek to address both problems by introducing a self-supervised deep learning architecture to segment catheters in longitudinal ultrasound images, without demanding any labeled data. The network architecture builds upon AiAReSeg, a segmentation transformer built with the Attention in Attention mechanism, and is capable of learning feature changes across time and space. To facilitate training, we used synthetic ultrasound data based on physics-driven catheter insertion simulations, and translated the data into a unique CT-Ultrasound common domain, CACTUSS, to improve the segmentation performance. We generated ground truth segmentation masks by computing the optical flow between adjacent frames using FlowNet2, and performed thresholding to obtain a binary map estimate. Finally, we validated our model on a test dataset, consisting of unseen synthetic data and images collected from silicon aorta phantoms, thus demonstrating its potential for applications to clinical data in the future.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to use ultrasonic imaging instead of radiographic imaging techniques in minimally invasive endovascular surgery, in order to reduce the risk of patients and doctors being exposed to radiation, while improving the accuracy of catheter segmentation in ultrasonic images. Specifically, the paper proposes a self - supervised deep - learning architecture (CathFlow), which aims to automatically segment catheters in longitudinal ultrasonic images without the need for labeled data. This method utilizes the optical flow method and the Transformer model, promotes training through synthetic ultrasonic data and physically - driven catheter insertion simulations, and improves segmentation performance through a general CT - ultrasonic domain conversion technique called CACTUSS. The core challenges of the paper are as follows: 1. **Reducing radiation exposure**: In minimally invasive endovascular surgery, radiographic imaging techniques (such as fluoroscopy) are usually used to guide the surgery, but this method will increase the risk of radiation exposure for patients and doctors. 2. **Improving the interpretability of ultrasonic images**: Although ultrasound is a radiation - free imaging technique, it is difficult to interpret, is easily affected by artifacts and noise, and the image quality depends on the operator's experience and machine settings. 3. **Self - supervised learning**: Since it is very difficult to obtain a large amount of labeled ultrasonic data, the paper proposes a self - supervised method, which generates the labels required for training through synthetic data and physical simulations, thus avoiding the need for manual labeling. Through these methods, the paper aims to develop a self - supervised deep - learning model that can accurately segment catheters in ultrasonic images, and finally apply it to clinical data to improve the safety and efficiency of minimally invasive endovascular surgery.