Abstract:RGB image and depth map (RGB-D)-based salient object detection (SOD) has been well-studied in recent years, especially using deep neural networks. An RGB image provides rich local and semantic features, while the depth map provides global structural information. Many researchers have treated depth information as a supplement to RGB maps. However, depth maps in various datasets are not as precise as RGB information, as they are captured under various conditions. Therefore, thoroughly exploiting these features at different levels remains unresolved. Many cognitive theories, such as the topological perception theory, claim that global properties are prior to local ones and are important for human recognition. In this paper, we propose a novel global-prior-guided fusion network with global-prior extraction modules to fuse cross-modality features. Each module contains a cross attention guided by deeper global priors, and the global prior extracted by this module is used to guide the processing of local features in shallow layers. The global guided network first integrates the local and global cross features into the decoder of depth maps, and then the fused structural features of the decoder are finally fused into the saliency decoder. Experimental results show that our method outperformed other state-of-the-art methods in the RGB-D-based SOD task on seven datasets (i.e., DUT-RGBD, NJUD, LFSD, NLPR, RGBD135, SIP, and STERE) and in terms of most metrics. To thoroughly exploit the modules we designed, we extended our model to accomplish the tasks of RGB and video SOD with slight adaptions, and obtained results comparable to those of the state-of-the-art (SOTA) methods in both fields.

LGCNet: A Local-to-global Context-Aware Feature Augmentation Network for Salient Object Detection

A Learning-Based Method Using Data Augmentation for Light Field Salient Object Detection

Dual-Branch Feature Fusion Network for Salient Object Detection

Global contextual guided residual attention network for salient object detection

Evolution, maturation, and regression of lesions of lichen planus: New observations and correlations of clinical and histologic findings

Global Context Encoding For Salient Objects Detection

SAC-Net: Spatial Attenuation Context for Salient Object Detection

Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images

CEMINet: Context exploration and multi-level interaction network for salient object detection

Interactive Context-Aware Network for RGB-T Salient Object Detection

CSNet: a ConvNeXt-based Siamese network for RGB-D salient object detection

Complementary characteristics fusion network for weakly supervised salient object detection

Learning discriminative context for salient object detection

MFCINet: multi-level feature and context information fusion network for RGB-D salient object detection

Global-prior-guided fusion network for salient object detection

Stabilization, purification and crystallization of catalytic subunit of cAMP-dependent protein kinase from bovine heart.

LARNet:Towards Lightweight, Accurate and Real-time Salient Object Detection

Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection

Multi-Level Context Aggregation Network with Channel-Wise Attention for Salient Object Detection

An adaptive guidance fusion network for RGB-D salient object detection

Global Perception Network for Salient Object Detection in Remote Sensing Images