Abstract:Although deep learning-based techniques for salient object detection have considerably improved over recent years, estimated saliency maps still exhibit imprecise predictions owing to the internal complexity and indefinite boundaries of salient objects of varying sizes. Existing methods emphasize the design of an exemplary structure to integrate multi-level features by employing multi-scale features and attention modules to filter salient regions from cluttered scenarios. We propose a saliency detection network based on three novel contributions. First, we use a dense feature extraction unit (DFEU) by introducing large kernels of asymmetric and grouped-wise convolutions with channel reshuffling. The DFEU extracts semantically enriched features with large receptive fields and reduces the gridding problem and parameter sizes for subsequent operations. Second, we suggest a cross-feature integration unit (CFIU) that extracts semantically enriched features from their high resolutions using dense short connections and sub-samples the integrated information into different attentional branches based on the inputs received for each stage of the backbone. The embedded independent attentional branches can observe the importance of the sub-regions for a salient object. With the constraint-wise growth of the sub-attentional branches at various stages, the CFIU can efficiently avoid global and local feature dilution effects by extracting semantically enriched features via dense short-connections from high and low levels. Finally, a contour-aware saliency refinement unit (CSRU) was devised by blending the contour and contextual features in a progressive dense connected fashion to assist the model toward obtaining more accurate saliency maps with precise boundaries in complex and perplexing scenarios. Our proposed model was analyzed with ResNet-50 and VGG-16 and outperforms most contemporary techniques with fewer parameters.

U2-Net: Going deeper with nested U-structure for salient object detection

Dual-Branch Feature Fusion Network for Salient Object Detection

Salient Object Detection Based on Visual Perceptual Saturation and Two-Stream Hybrid Networks.

Rethinking Two-B-Real Net for Real-Time Salient Object Detection

Two-B-Real Net: Two-Branch Network For Real-Time Salient Object Detection

Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection.

EDN: Salient Object Detection via Extremely-Downsampled Network

U2-ONet: A Two-level Nested Octave U-structure with Multiscale Attention Mechanism for Moving Instances Segmentation

Nested Network With Two-Stream Pyramid for Salient Object Detection in Optical Remote Sensing Images

M$^3$Net: Multilevel, Mixed and Multistage Attention Network for Salient Object Detection

Richer and Deeper Supervision Network for Salient Object Detection

BASNet: Boundary-Aware Salient Object Detection

Boosting Salient Object Detection with Transformer-based Asymmetric Bilateral U-Net

UDNet: Uncertainty-aware Deep Network for Salient Object Detection

DHSNet: Deep Hierarchical Saliency Network for Salient Object Detection

U2-ONet: A Two-Level Nested Octave U-Structure Network with a Multi-Scale Attention Mechanism for Moving Object Segmentation

SAMNet: Stereoscopically Attentive Multi-Scale Network for Lightweight Salient Object Detection

A Unified Structure for Efficient RGB and RGB-D Salient Object Detection

QCNet: Query Context Network for Salient Object Detection of Automatic Surface Inspection

Densely Nested Top-Down Flows for Salient Object Detection

AWANet: Attentive-Aware Wide-Kernels Asymmetrical Network with Blended Contour Information for Salient Object Detection