Abstract:Recent progress in salient object detection (SOD) mainly depends on dilated convolution with different receptive fields to capture contextual information for multi-scale learning. Intuitively, contextual information in different scales is conducive to understanding the image content, and thus can help us identify and locate salient objects in real-world scenes. However, the sparsity inside the dilated convolution kernel may cause the problem of local information loss, limiting the predictive accuracy of the model. In addition, the inequality of feature channels should also be considered, and they often feature different deviations for salient objects or background noises. Although some channel attention mechanisms have been proposed in SOD, their ability to capture global information is limited, and the problem of high complexity is still a great challenge. To alleviate the abovementioned problems, we propose a Related Context-Driven Network (RCNet) with Hierarchical Attention for Salient Object Detection, consisting of a cascaded multi-scale context exploration (CMCE) module and a hierarchical feature aggregation (HFA) module. The CMCE module is to capture multi-scale contextual information through using multi-receptive-field dilated convolutions in a diamond hierarchical structure, where a feature reconstruction operation is deployed to improve the correlation of features, effectively avoiding the gridding problems and local information loss. Meanwhile, the HFA module adaptively interacts with the complementary information of the multi-level features to further capture the important information from within the feature channel by a multi-source hybrid channel attention (MHCA) mechanism to generate powerful and robust feature representations. Extensive experiments on six benchmark datasets demonstrate that the proposed RCNet method consistently outperforms 20 existing the state-of-the-art SOD methods in terms of accuracy, generalization capacity and robustness.

Global contextual guided residual attention network for salient object detection

Progressive Attention Guided Recurrent Network for Salient Object Detection

Residual attentive feature learning network for salient object detection

Residual Dense Collaborative Network for Salient Object Detection

Global Context Encoding For Salient Objects Detection

Global Context-Aware Progressive Aggregation Network for Salient Object Detection

Reverse Attention Based Residual Network for Salient Object Detection.

Multi-level and multi-scale deep saliency network for salient object detection

A Multistage Refinement Network for Salient Object Detection

Accurate salient object detection via dense recurrent connections and residual-based hierarchical feature integration.

Deep Salient Object Detection with Contextual Information Guidance

Multi-Level Context Aggregation Network with Channel-Wise Attention for Salient Object Detection

LGCNet: A Local-to-global Context-Aware Feature Augmentation Network for Salient Object Detection

Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection

Rcnet: Related Context-Driven Network with Hierarchical Attention for Salient Object Detection

Co-Saliency Detection With Co-Attention Fully Convolutional Network

Attentive feature integration network for detecting salient objects in images

Residual Learning for Salient Object Detection

Global Context-Aware Multi-Scale Features Aggregative Network for Salient Object Detection

Residual Refinement Network with Attribute Guidance for Precise Saliency Detection

Salient Object Detection Via Multi-Scale Attention CNN