SATMask: Spatial Attention Transform Mask for Dense Instance Segmentation.

Quanzhong Mao,Lijuan Sun,Jingchen Wu,Yutong Gao,Xu Wu,Lirong Qiu
DOI: https://doi.org/10.1109/dsc55868.2022.00089
2022-01-01
Abstract:There are often dense objects in the images processed by instance segmentation, but too dense objects will cause the problem that the objects are difficult to segment. Most of the current dense instance segmentation methods are based on dense sliding window such as TensorMask. However, the sliding window has the problems of high computation and difficult design of anchor. In order to solve the above difficulties, we propose an anchor-free and single shot dense image segmentation framework, named SATMask, which adds a Spatial Attention Transform (SAT) mask head on anchor-free one stage object detector (FCOS) to predict high quality instance mask with low complexity, and uses feature-aligned pyramid network to fuse the feature map generated by backbone to obtain rich spatial details and better semantic information. Extensive experiments on the challenging COCO and Cityscapes datasets demonstrate the effectiveness of SATMask. In particular, under the same backbone (ResNet-101), SATMask achieves 39.8% AP on COCO, surpassing the state-of-the-art instance segmentation method Mask R-CNN 1.5% AP.
What problem does this paper attempt to address?