Sequential Rib Labeling and Segmentation in Chest X-Ray using Mask R-CNN

Jöran Wessel,Mattias P. Heinrich,Jens von Berg,Astrid Franz,Axel Saalbach
DOI: https://doi.org/10.48550/arXiv.1908.08329
2019-08-22
Abstract:Mask R-CNN is a state-of-the-art network architecture for the detection and segmentation of object instances in the computer vision domain. In this contribution, it is used to localize, label and segment individual ribs in anterior-posterior chest X-ray images. For this purpose, several extensions have been made to the original architecture, in order to address the specific challenges of this application. This includes the use of rib specific networks, facilitating dedicated anchor boxes sampled from a training set, as well as a sequential processing of all ribs. Here, the segmentation result of the upper neighbor rib is used as additional input to the network. This approach is the first addressing both rib segmentation and anatomical labeling in chest radiographs. The results are comparable or even better than existing methods aiming only at segmentation.
Image and Video Processing
What problem does this paper attempt to address?
This paper attempts to solve the problem of simultaneously performing rib detection and segmentation in chest X - ray images. Specifically, the author uses Mask R - CNN, an advanced network architecture, and makes several extensions to it to address the specific challenges in rib detection and segmentation tasks. These challenges include the high self - similarity among ribs and the need to accurately locate and label each rib. By introducing rib - specific networks, dedicated anchor boxes, and a method of sequentially processing all ribs, this research aims to provide a solution that can simultaneously achieve rib segmentation and anatomical labeling. This is the first time that a method can solve these two problems simultaneously, and its performance is comparable to or better than that of existing methods that only focus on segmentation.