Feature-Selection High-Resolution Network With Hypersphere Embedding for Semantic Segmentation of VHR Remote Sensing Images
Hanwen Xu,Xinming Tang,Bo Ai,Fanlin Yang,Zhen Wen,Xiaomeng Yang
DOI: https://doi.org/10.1109/tgrs.2022.3183144
IF: 8.2
2022-06-29
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Very-high-resolution (VHR) remote sensing images contain various multiscale objects, such as large-scale buildings and small-scale cars. However, these multiscale objects cannot be considered simultaneously in the widely used backbones with a large downsampling factor (e.g., VGG-like and ResNet-like), resulting in the appearance of various context aggregation approaches, such as fusing low-level features and attention-based modules. To alleviate this problem caused by backbones with a large downsampling factor, we propose a feature-selection high-resolution network (FSHRNet) based on an observation: if the features maintain high resolution throughout the network, a high precision segmentation result can be obtained by only using a 1 convolution layer with no need for complex context aggregation modules. Specifically, the backbone of FSHRNet is a multibranch structure similar to HRNet where the high-resolution branch is the principal line. Then, a lightweight dynamic weight module, named the feature-selection convolution (FSConv) layer, is presented to fuse multiresolution features, allowing adaptively feature selection based on the characteristic of objects. Finally, a specially designed 1 convolution layer derived from hypersphere embedding is used to produce the segmentation result. Experiments with other widely used methods show that the proposed FSHRNet obtains competitive performance on the ISPRS Vaihingen dataset, the ISPRS Potsdam dataset, and the iSAID dataset.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics