Endo3d: Online Workflow Analysis For Endoscopic Surgeries Based On 3d Cnn And Lstm

Weixiang Chen,Jianjiang Feng,Jiwen Lu,Jie Zhou
DOI: https://doi.org/10.1007/978-3-030-01201-4_12
2018-01-01
Abstract:Surgical workflow analysis is an important topic of computer-assisted intervention and phase recognition is one of its important tasks. Features extracted from video frames by 2D convolutional networks were proved feasible for online phase analysis in former publications. In this paper, we propose to extract fine-level temporal features from video clips using 3D convolutional networks (CNN) and use Long Short-Term Memory (LSTM) networks to capture coarse-level information. By combining fine-level and coarse-level information, our proposed method outperforms state-of-the-art online methods without using specific knowledge of surgeries and almost reaches the state-of-the-art offline performance.
What problem does this paper attempt to address?