Abstract:Recent progress in salient object detection (SOD) mainly depends on dilated convolution with different receptive fields to capture contextual information for multi-scale learning. Intuitively, contextual information in different scales is conducive to understanding the image content, and thus can help us identify and locate salient objects in real-world scenes. However, the sparsity inside the dilated convolution kernel may cause the problem of local information loss, limiting the predictive accuracy of the model. In addition, the inequality of feature channels should also be considered, and they often feature different deviations for salient objects or background noises. Although some channel attention mechanisms have been proposed in SOD, their ability to capture global information is limited, and the problem of high complexity is still a great challenge. To alleviate the abovementioned problems, we propose a Related Context-Driven Network (RCNet) with Hierarchical Attention for Salient Object Detection, consisting of a cascaded multi-scale context exploration (CMCE) module and a hierarchical feature aggregation (HFA) module. The CMCE module is to capture multi-scale contextual information through using multi-receptive-field dilated convolutions in a diamond hierarchical structure, where a feature reconstruction operation is deployed to improve the correlation of features, effectively avoiding the gridding problems and local information loss. Meanwhile, the HFA module adaptively interacts with the complementary information of the multi-level features to further capture the important information from within the feature channel by a multi-source hybrid channel attention (MHCA) mechanism to generate powerful and robust feature representations. Extensive experiments on six benchmark datasets demonstrate that the proposed RCNet method consistently outperforms 20 existing the state-of-the-art SOD methods in terms of accuracy, generalization capacity and robustness.

Rcnet: Related Context-Driven Network with Hierarchical Attention for Salient Object Detection

CEMINet: Context exploration and multi-level interaction network for salient object detection

Emcenet: Efficient Multi-Scale Context Exploration Network for Salient Object Detection

Residual Dense Collaborative Network for Salient Object Detection

Compensated Attention Feature Fusion and Hierarchical Multiplication Decoder Network for RGB-D Salient Object Detection

Deep Feature Filtering and Contextual Information Gathering Network for RGB-D Salient Object Detection

Interactive Context-Aware Network for RGB-T Salient Object Detection

Deep Salient Object Detection Via Hierarchical Network Learning

LGCNet: A Local-to-global Context-Aware Feature Augmentation Network for Salient Object Detection

Cross-modal refined adjacent-guided network for RGB-D salient object detection

CIR-Net: Cross-Modality Interaction and Refinement for RGB-D Salient Object Detection

Global contextual guided residual attention network for salient object detection

RRNet: Relational Reasoning Network with Parallel Multi-scale Attention for Salient Object Detection in Optical Remote Sensing Images

HDNet: Multi-Modality Hierarchy-Aware Decision Network for RGB-D Salient Object Detection

Cross-modal and Cross-level Attention Interaction Network for Salient Object Detection

MFCINet: multi-level feature and context information fusion network for RGB-D salient object detection

Hierarchical Dynamic Filtering Network for RGB-D Salient Object Detection

Collaborative Content-Dependent Modeling: A Return to the Roots of Salient Object Detection.

RRNet: Relational Reasoning Network With Parallel Multiscale Attention for Salient Object Detection in Optical Remote Sensing Images

Dynamic Selective Network for RGB-D Salient Object Detection

Contextual Attention Enhanced Network for Salient Object Detection in Optical Remote Sensing Images