Autonomous navigation of catheters and guidewires in mechanical thrombectomy using inverse reinforcement learning

Harry Robertshaw,Lennart Karstensen,Benjamin Jackson,Alejandro Granados,Thomas C. Booth
DOI: https://doi.org/10.1007/s11548-024-03208-w
2024-06-18
Abstract:Purpose: Autonomous navigation of catheters and guidewires can enhance endovascular surgery safety and efficacy, reducing procedure times and operator radiation exposure. Integrating tele-operated robotics could widen access to time-sensitive emergency procedures like mechanical thrombectomy (MT). Reinforcement learning (RL) shows potential in endovascular navigation, yet its application encounters challenges without a reward signal. This study explores the viability of autonomous navigation in MT vasculature using inverse RL (IRL) to leverage expert demonstrations. Methods: This study established a simulation-based training and evaluation environment for MT navigation. We used IRL to infer reward functions from expert behaviour when navigating a guidewire and catheter. We utilized soft actor-critic to train models with various reward functions and compared their performance in silico. Results: We demonstrated feasibility of navigation using IRL. When evaluating single versus dual device (i.e. guidewire versus catheter and guidewire) tracking, both methods achieved high success rates of 95% and 96%, respectively. Dual-tracking, however, utilized both devices mimicking an expert. A success rate of 100% and procedure time of 22.6 s were obtained when training with a reward function obtained through reward shaping. This outperformed a dense reward function (96%, 24.9 s) and an IRL-derived reward function (48%, 59.2 s). Conclusions: We have contributed to the advancement of autonomous endovascular intervention navigation, particularly MT, by employing IRL. The results underscore the potential of using reward shaping to train models, offering a promising avenue for enhancing the accessibility and precision of MT. We envisage that future research can extend our methodology to diverse anatomical structures to enhance generalizability.
Machine Learning,Artificial Intelligence,Robotics
What problem does this paper attempt to address?
The paper attempts to address the problem of achieving autonomous navigation of catheters and guidewires in Mechanical Thrombectomy (MT) to improve the safety and efficiency of the surgery, reduce operation time, and minimize operator radiation exposure. By introducing remote operation robotic technology, the accessibility of time-sensitive emergency surgeries (such as Mechanical Thrombectomy) can be expanded. However, the application of Reinforcement Learning (RL) in intravascular navigation faces the challenge of lacking reward signals. To this end, the study explores the feasibility of using Inverse Reinforcement Learning (IRL) to infer reward functions from expert demonstrations to achieve autonomous guidewire navigation. Specifically, the paper addresses the above issues through the following points: 1. **Establishing a Simulation Environment**: A simulation-based training and evaluation environment for Mechanical Thrombectomy navigation was created using the Simulation Open Framework Architecture (SOFA). 2. **Inferring Reward Functions Using IRL**: Reward functions were inferred from expert behaviors to train the model. 3. **Model Training and Comparison**: Models with different reward functions were trained using the Soft Actor-Critic (SAC) algorithm and their performance was compared in the simulation environment. 4. **Evaluating Different Navigation Methods**: The navigation performance of single-device (guidewire only) and dual-device (guidewire and catheter) tracking methods was compared. The research results indicate that by combining reward shaping with dense reward functions and IRL-inferred reward functions, the success rate of autonomous navigation can be significantly improved and operation time can be reduced. This provides a new approach to enhancing the accessibility and precision of Mechanical Thrombectomy.