Dpnet: End-to-end Aerial Image Segmentation Via Deformable Point Network.

Yiyou Guo,Zheyun Qin,Yikun Liu,Yongtai Yang,Xiankai Lu,Huan Xie,Xiaohua Tong
DOI: https://doi.org/10.1109/igarss46834.2022.9883974
2022-01-01
Abstract:Aerial image Segmentation segmentation faces intrinsic foreground-background imbalance and background clutter distraction. To guide the segmentation model to learn more discriminative foreground ability and more invariant back-ground representation features, we design a Deformable Point Network (DPNet). It is an end-to-end segmentation network and consists of a multi-head deformable attention module that simultaneously considers foreground object information and background suppression. Specifically, we first employ a feature pyramid network to aggregate multiple-layer features to handle scale variants. And then, we further investigate deformable convolution to select some representative points for each layer and propose a differential module to implement it automatically instead of traditional dense fusion. Moreover, we incorporate the multi-head mechanism in the feature fusion to focus on the key contents from different representation regions. Experimental results on the representative iSAID, Vaihingen, and Postdam datasets demonstrate that our DPNet achieves competitive performance. Also, the multiple-head deformable attention facilitates the network convergence significantly.
What problem does this paper attempt to address?