Multiscale Matters for Part Segmentation of Instruments in Robotic Surgery.

Wenhao He,Haitao Song,Yue Guo,Guibin Bian,Yuejie Sun,Xiaowei Zhou,Xiaonan Wang
DOI: https://doi.org/10.1049/iet-ipr.2020.0320
IF: 2.3
2020-01-01
IET Image Processing
Abstract:A challenging aspect of instrument segmentation in robotic surgery is to distinguish different parts of the same instrument. Parts with similar textures are common in a practical instrument and are difficult to distinguish. In this work, the authors introduce an end-to-end recurrent model that comprises a multiscale semantic segmentation network and a refinement model. Specifically, the semantic segmentation network uniformly transforms the input images in multiple scales into a semantic mask, and the refinement model is a single-scale net recurrently optimising the above semantic mask. Through extensive experiments, the authors validate that the models with multiscale inputs perform better than those to fuse encoded feature maps and ones with spatial attention. Furthermore, the authors verify the effectiveness of the proposed model with state-of-the-art performances on several robotic instrument datasets derived from MICCAI Endoscopic Vision Challenges.
What problem does this paper attempt to address?