Road and Car Extraction Using UAV Images Via Efficient Dual Contextual Parsing Network

Yueming Sun,Zhenfeng Shao,Gui Cheng,Xiao Huang,Zhongyuan Wang
DOI: https://doi.org/10.1109/tgrs.2022.3214246
IF: 8.2
2022-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:The rapid development and commercialization of unmanned aerial vehicle (UAV) technology has made it possible to conduct urban traffic information extraction using UAV images. However, the large variations of targets in urban environments, complex foregrounds and backgrounds in cities, and severe tree and shadow occlusions pose great challenges in car and road extraction using UAV images. In this study, we propose a lightweight, efficient dual contextual parsing network (EDCPNet) to address the above issues. The proposed efficient dual contextual parsing (EDCP) module in EDCPNet is mainly composed of spatial contextual parsing (SCP) and channel contextual parsing (CCP), which can effectively acquire rich contextual features in both spatial and channel dimensions, adaptively recalibrate the attention weights, perceive the salient features of targets in images, and suppress the importance of irrelevant elements. It, thus, leads to improved performance and adaptability that facilitate the practical applications of large-scale urban traffic monitoring in complex urban scenes. We conduct experiments on two benchmark datasets [UAV image dataset (UAVid) and urban drone dataset (UDD)] by comparing the proposed EDCPNet with six other competing methods, i.e., U-Net, PSPNet, Deelabv3+, SegNet, ESNet, and ERFNet, and validate the effectiveness of the proposed EDCP module via extensive ablation studies. The results suggest that the proposed network outperforms all competing methods in car and road extraction from UAV images with a balanced computational cost. Its great performance and low computational demand (with only 2.37M model parameters) facilitate its deployment on edge computing devices with memory constraints.
What problem does this paper attempt to address?