High-Order Semantic Decoupling Network for Remote Sensing Image Semantic Segmentation
Chengyu Zheng,Jie Nie,Zhaoxin Wang,Ning Song,Jingyu Wang,Zhiqiang Wei
DOI: https://doi.org/10.1109/tgrs.2023.3249230
IF: 8.2
2023-03-22
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Low-order features based on convolution kernel are easy to be distorted when encountering dramatic view angle transformation and atmospheric scattering in remote sensing (RS) images. To address this concern, this article first proposes to operate semantic segmentation of RS images based on the high-order information, which can represent the relative relationship of low-order features and is robust and stable when suffering feature distortion. Besides, semantic decouples have recently been well researched and have achieved significant improvement in image understanding. Thus, in this article, a high-order semantic decoupling network (HSDN) is proposed to disentangle features by semantics based on high-order features. Specifically, HSDN first represents each pixel by calculating the pixel-level affinity as a high-order feature and then clusters these pixels into different semantics. Afterward, an attention-like mask generation module is designed for both intra-semantic and inter-semantic groups, leading to three kinds of masks, including the semantic decoupling mask (SDM), which utilizes each high-order cluster centroid as a mask to compact features intracluster and expand different interclusters, so as to improve semantic disentangle performance to a better extent; semantic enhancement mask (SEM), which records pixel-level relative correlation within a class to sufficiently exploit high-order features and could enhance feature robustness; and boundary supplementary mask (BSM), which aims to process borderline pixels to reduce cluster errors. Finally, by applying masks on pixels both within classes and on borderlines, semantic decoupled features are generated and concatenated to realize segmentation. The quantitative and qualitative experiments are conducted on two large-scale fine-resolution RS image datasets to demonstrate the significant performance of adopting high-order representation. Besides, we also implement numerous experiments to va- idate the effectiveness of the proposed semantic decouple framework in dealing with complicated and distortion-prone RS image segmentation tasks.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics