Abstract:Objective: The computation of anatomical information and laparoscope position is a fundamental block of surgical navigation in Minimally Invasive Surgery (MIS). Recovering a dense 3D structure of surgical scene using visual cues remains a challenge, and the online laparoscopic tracking primarily relies on external sensors, which increases system complexity. Methods: Here, we propose a learning-driven framework, in which an image-guided laparoscopic localization with 3D reconstructions of complex anatomical structures is obtained. To reconstruct the 3D structure of the whole surgical environment, we first fine-tune a learning-based stereoscopic depth perception method, which is robust to the texture-less and variant soft tissues, for depth estimation. Then, we develop a dense visual reconstruction algorithm to represent the scene by surfels, estimate the laparoscope poses and fuse the depth maps into a unified reference coordinate for tissue reconstruction. To estimate poses of new laparoscope views, we achieve a coarse-to-fine localization method, which incorporates our reconstructed 3D model. Results: We evaluate the reconstruction method and the localization module on three datasets, namely, the stereo correspondence and reconstruction of endoscopic data (SCARED), the ex-vivo phantom and tissue data collected with Universal Robot (UR) and Karl Storz Laparoscope, and the in-vivo DaVinci robotic surgery dataset, where the reconstructed 3D structures have rich details of surface texture with an accuracy error under 1.71 mm and the localization module can accurately track the laparoscope with only images as input. Conclusions: Experimental results demonstrate the superior performance of the proposed method in 3D anatomy reconstruction and laparoscopic localization. Significance: The proposed framework can be potentially extended to the current surgical navigation system.

Unsupervised-Learning-Based Continuous Depth and Motion Estimation with Monocular Endoscopy for Virtual Reality Minimally Invasive Surgery

Distilled Visual and Robot Kinematics Embeddings for Metric Depth Estimation in Monocular Scene Reconstruction

Self-Supervised Siamese Learning on Stereo Image Pairs for Depth Estimation in Robotic Surgery

Joint estimation of depth and motion from a monocular endoscopy image sequence using a multi-loss rebalancing network

Self-Supervised Monocular Depth Estimation for Endoscopic Imaging

Stereo Dense Scene Reconstruction and Accurate Localization for Learning-Based Navigation of Laparoscope in Minimally Invasive Surgery

Self-Supervised Learning for Monocular Depth Estimation on Minimally Invasive Surgery Scenes

Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors

MonoLoT: Self-Supervised Monocular Depth Estimation in Low-Texture Scenes for Automatic Robotic Endoscopy

Augmented Reality for Depth Cues in Monocular Minimally Invasive Surgery

Stereo Video Reconstruction Without Explicit Depth Maps for Endoscopic Surgery

Self-supervised monocular depth estimation for gastrointestinal endoscopy

Image Intrinsic-Based Unsupervised Monocular Depth Estimation in Endoscopy

Self-supervised neural network-based endoscopic monocular 3D reconstruction method

Unsupervised Monocular Depth Estimation for Monocular Visual SLAM Systems

Self-supervised monocular depth estimation for high field of view colonoscopy cameras

Self-supervised Monocular Depth Estimation with 3D Displacement Module for Laparoscopic Images

Long term and robust 6DoF motion tracking for highly dynamic stereo endoscopy videos

A deep learning framework for real-time 3D model registration in robot-assisted laparoscopic surgery

Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy