Multi-Task Classification and Segmentation for Explicable Capsule Endoscopy Diagnostics

Zishang Kong,Min He,Qianjiang Luo,Xiansong Huang,Pengxu Wei,Yalu Cheng,Luyang Chen,Yongsheng Liang,Yanchang Lu,Xi Li,Jie Chen
DOI: https://doi.org/10.3389/fmolb.2021.614277
IF: 6.113
2021-01-01
Frontiers in Molecular Biosciences
Abstract:Capsule endoscopy is a leading diagnostic tool for small bowel lesions which faces certain challenges such as time-consuming interpretation and harsh optical environment inside the small intestine. Specialists unavoidably waste lots of time on searching for a high clearness degree image for accurate diagnostics. However, current clearness degree classification methods are based on either traditional attributes or an unexplainable deep neural network. In this paper, we propose a multi-task framework, called the multi-task classification and segmentation network (MTCSN), to achieve joint learning of clearness degree (CD) and tissue semantic segmentation (TSS) for the first time. In the MTCSN, the CD helps to generate better refined TSS, while TSS provides an explicable semantic map to better classify the CD. In addition, we present a new benchmark, named the Capsule-Endoscopy Crohn's Disease dataset, which introduces the challenges faced in the real world including motion blur, excreta occlusion, reflection, and various complex alimentary scenes that are widely acknowledged in endoscopy examination. Extensive experiments and ablation studies report the significant performance gains of the MTCSN over state-of-the-art methods.
What problem does this paper attempt to address?