Bidirectional Mutual Guidance Transformer for Salient Object Detection in Optical Remote Sensing Images

Kan Huang,Chunwei Tian,Ge Li
DOI: https://doi.org/10.1080/01431161.2023.2229494
IF: 3.531
2023-01-01
International Journal of Remote Sensing
Abstract:Salient object detection in optical remote sensing images presents great challenges due to the characteristics of remote sensing images such as cluttered background, varying object scales, and unstable imaging conditions, etc. In this paper, we present a Bidirectional Mutual Guidance Transformer (BMGT), which mitigates the locality issue of CNN-based models, and exploits the mutual guidance between global context-aware object representations and fine-grained boundary structures. It contains a hierarchically structured Transformer encoder that extracts multi-level multi-scale token representations, and a dual-stream cross-task MLP decoder that performs joint salient object detection and salient boundary detection in an end-to-end manner. In particular, the dual-stream decoder consists of two sub-branch networks with symmetric architectures, that are connected by a newly proposed Mutual Guidance MLP layer (MG-MLP). Through MG-MLP, salient object features and salient boundary features interact with each other, facilitating complementary learning at multiple network levels. Extensive evaluations demonstrate that our proposed method outperforms other existing methods in two public remote sensing image benchmarks. It proves that our BMGT is advantageous in exploiting long-range context dependencies as well as preserving fine-grained boundary structures.
What problem does this paper attempt to address?