Abstract:Recent progress in salient object detection (SOD) mainly depends on dilated convolution with different receptive fields to capture contextual information for multi-scale learning. Intuitively, contextual information in different scales is conducive to understanding the image content, and thus can help us identify and locate salient objects in real-world scenes. However, the sparsity inside the dilated convolution kernel may cause the problem of local information loss, limiting the predictive accuracy of the model. In addition, the inequality of feature channels should also be considered, and they often feature different deviations for salient objects or background noises. Although some channel attention mechanisms have been proposed in SOD, their ability to capture global information is limited, and the problem of high complexity is still a great challenge. To alleviate the abovementioned problems, we propose a Related Context-Driven Network (RCNet) with Hierarchical Attention for Salient Object Detection, consisting of a cascaded multi-scale context exploration (CMCE) module and a hierarchical feature aggregation (HFA) module. The CMCE module is to capture multi-scale contextual information through using multi-receptive-field dilated convolutions in a diamond hierarchical structure, where a feature reconstruction operation is deployed to improve the correlation of features, effectively avoiding the gridding problems and local information loss. Meanwhile, the HFA module adaptively interacts with the complementary information of the multi-level features to further capture the important information from within the feature channel by a multi-source hybrid channel attention (MHCA) mechanism to generate powerful and robust feature representations. Extensive experiments on six benchmark datasets demonstrate that the proposed RCNet method consistently outperforms 20 existing the state-of-the-art SOD methods in terms of accuracy, generalization capacity and robustness.

Recurrent Attentional Networks for Saliency Detection

Accurate salient object detection via dense recurrent connections and residual-based hierarchical feature integration.

Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection.

Saliency Detection with Recurrent Fully Convolutional Networks

Salient Object Detection with Recurrent Fully Convolutional Networks.

Self-Attention Recurrent Network for Saliency Detection

Progressive Attention Guided Recurrent Network for Salient Object Detection

R³Net: Recurrent Residual Refinement Network for Saliency Detection

Salient Object Detection Via Recurrently Aggregating Spatial Attention Weighted Cross-Level Deep Features

Reverse Attention Based Residual Network for Salient Object Detection.

Salient Object Detection Via Multi-Scale Attention CNN

Saliency Detection With a Three-Stage Hierarchical Network

Depth-induced Multi-scale Recurrent Attention Network for Saliency Detection

Contour-Aware Recurrent Cross Constraint Network for Salient Object Detection

A Deep Spatial Contextual Long-term Recurrent Convolutional Network for Saliency Detection

A ConvLSTM-Combined Hierarchical Attention Network for Saliency Detection.

Multi-level and multi-scale deep saliency network for salient object detection

Recurrent Double Features: Recurrent Multi-scale Deep Features and Saliency Features for Salient Object Detection

Rcnet: Related Context-Driven Network with Hierarchical Attention for Salient Object Detection

Deep Sub-Region Network for Salient Object Detection

AWANet: Attentive-Aware Wide-Kernels Asymmetrical Network with Blended Contour Information for Salient Object Detection