Abstract:Although deep learning-based techniques for salient object detection have considerably improved over recent years, estimated saliency maps still exhibit imprecise predictions owing to the internal complexity and indefinite boundaries of salient objects of varying sizes. Existing methods emphasize the design of an exemplary structure to integrate multi-level features by employing multi-scale features and attention modules to filter salient regions from cluttered scenarios. We propose a saliency detection network based on three novel contributions. First, we use a dense feature extraction unit (DFEU) by introducing large kernels of asymmetric and grouped-wise convolutions with channel reshuffling. The DFEU extracts semantically enriched features with large receptive fields and reduces the gridding problem and parameter sizes for subsequent operations. Second, we suggest a cross-feature integration unit (CFIU) that extracts semantically enriched features from their high resolutions using dense short connections and sub-samples the integrated information into different attentional branches based on the inputs received for each stage of the backbone. The embedded independent attentional branches can observe the importance of the sub-regions for a salient object. With the constraint-wise growth of the sub-attentional branches at various stages, the CFIU can efficiently avoid global and local feature dilution effects by extracting semantically enriched features via dense short-connections from high and low levels. Finally, a contour-aware saliency refinement unit (CSRU) was devised by blending the contour and contextual features in a progressive dense connected fashion to assist the model toward obtaining more accurate saliency maps with precise boundaries in complex and perplexing scenarios. Our proposed model was analyzed with ResNet-50 and VGG-16 and outperforms most contemporary techniques with fewer parameters.

Foreground Gating and Background Refining Network for Surveillance Object Detection

Background-aware Siamese Network Tracking Based on Salient Feature Fusion

FRBNet: feature-iterative reinforcement and boundary-directed network for camouflaged object detection

High Efficient Moving Object Extraction and Classification in Traffic Video Surveillance

Foreground Detection in Surveillance Video with Fully Convolutional Semantic Network

Real-Time One-Stream Semantic-Guided Refinement Network for RGB-Thermal Salient Object Detection

A Novel Framework to Generate Synthetic Video for Foreground Detection in Highway Surveillance Scenarios

Real-time salient object detection with boundary information guidance

FII-CenterNet: An Anchor-Free Detector With Foreground Attention for Traffic Object Detection

Improved Post-Processing for Human Detection in Railroad Surveillance

Efficient Context-Guided Stacked Refinement Network for RGB-T Salient Object Detection

Gated forward refinement network for action segmentation

Foreground separation knowledge distillation for object detection

LiDAR-based 3D Video Object Detection with Foreground Context Modeling and Spatiotemporal Graph Reasoning

AWANet: Attentive-Aware Wide-Kernels Asymmetrical Network with Blended Contour Information for Salient Object Detection

FI-WSOD: Foreground Information Guided Weakly Supervised Object Detection

Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection

PRNet: Parallel Refinement Network With Group Feature Learning for Salient Object Detection in Optical Remote Sensing Images

Hierarchical and Interactive Refinement Network for Edge-Preserving Salient Object Detection

Suppress-and-Refine Framework for End-to-End 3D Object Detection

Lightweight Real-Time Object Detection via Enhanced Global Perception and Intra-Layer Interaction for Complex Traffic Scenarios