Bounding Box Embedding for Single Shot Person Instance Segmentation

Jacob Richeimer,Jonathan Mitchell
DOI: https://doi.org/10.48550/arXiv.1807.07674
2018-07-20
Computer Vision and Pattern Recognition
Abstract:We present a bottom-up approach for the task of object instance segmentation using a single-shot model. The proposed model employs a fully convolutional network which is trained to predict class-wise segmentation masks as well as the bounding boxes of the object instances to which each pixel belongs. This allows us to group object pixels into individual instances. Our network architecture is based on the DeepLabv3+ model, and requires only minimal extra computation to achieve pixel-wise instance assignments. We apply our method to the task of person instance segmentation, a common task relevant to many applications. We train our model with COCO data and report competitive results for the person class in the COCO instance segmentation task.
What problem does this paper attempt to address?