ASNet: Adaptive Semantic Network Based on Transformer–CNN for Salient Object Detection in Optical Remote Sensing Images
Ruixiang Yan,Longquan Yan,Guohua Geng,Yufei Cao,Pengbo Zhou,Yongle Meng
DOI: https://doi.org/10.1109/tgrs.2024.3362836
IF: 8.2
2024-02-16
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Salient object detection in optical remote sensing images (RSI-SOD) has recently become a key area of research, driven by the unique challenges posed by specific imaging conditions. Traditional approaches, largely based on convolutional neural networks (CNNs), are limited in handling the diverse scenarios of remote sensing due to their static network construction and reliance on local feature extraction. To tackle these limitations, we present the adaptive semantic network (ASNet), a novel framework specifically designed for RSI-SOD. ASNet innovatively integrates Transformer and CNN technologies in a dual-branch encoder, which captures both global dependencies and local fine-grained image details. The network also features an adaptive semantic matching module (ASMM) for dynamically harmonizing filter responses to global and local contexts, an adaptive feature enhancement module (AFEM) that effectively enhances salient region features while restoring image resolution, and a multiscale fine-grained inference module (MFIM) that refines high-level semantic features by integrating detailed low-level information, leading to the generation of precise, high-quality saliency maps. These components work in concert to adaptively respond to the complex nature of remote sensing images (RSIs). Extensive experimental evaluations confirm that ASNet substantially outperforms existing models in the RSI-SOD task.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics