Abstract:Although deep learning-based techniques for salient object detection have considerably improved over recent years, estimated saliency maps still exhibit imprecise predictions owing to the internal complexity and indefinite boundaries of salient objects of varying sizes. Existing methods emphasize the design of an exemplary structure to integrate multi-level features by employing multi-scale features and attention modules to filter salient regions from cluttered scenarios. We propose a saliency detection network based on three novel contributions. First, we use a dense feature extraction unit (DFEU) by introducing large kernels of asymmetric and grouped-wise convolutions with channel reshuffling. The DFEU extracts semantically enriched features with large receptive fields and reduces the gridding problem and parameter sizes for subsequent operations. Second, we suggest a cross-feature integration unit (CFIU) that extracts semantically enriched features from their high resolutions using dense short connections and sub-samples the integrated information into different attentional branches based on the inputs received for each stage of the backbone. The embedded independent attentional branches can observe the importance of the sub-regions for a salient object. With the constraint-wise growth of the sub-attentional branches at various stages, the CFIU can efficiently avoid global and local feature dilution effects by extracting semantically enriched features via dense short-connections from high and low levels. Finally, a contour-aware saliency refinement unit (CSRU) was devised by blending the contour and contextual features in a progressive dense connected fashion to assist the model toward obtaining more accurate saliency maps with precise boundaries in complex and perplexing scenarios. Our proposed model was analyzed with ResNet-50 and VGG-16 and outperforms most contemporary techniques with fewer parameters.

ABSNet: Aesthetics-Based Saliency Network Using Multi-Task Convolutional Network

TSMSAN: A Three-Stream Multi-Scale Attentive Network for Video Saliency Detection.

Specificity-preserving RGB-D Saliency Detection

DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection

Image Aesthetic Assessment Using a Saliency Symbiosis Network.

Towards accurate RGB-D saliency detection with complementary attention and adaptive integration

Bi-attention Network for Bi-Directional Salient Object Detection

A Novel Deep Network and Aggregation Model for Saliency Detection

Attentive and Context-Aware Deep Network for Saliency Prediction on Omni-Directional Images

MMMNet: An End-to-End Multi-Task Deep Convolution Neural Network With Multi-Scale and Multi-Hierarchy Fusion for Blind Image Quality Assessment

MSCAN: Multimodal Self-and-Collaborative Attention Network for Image Aesthetic Prediction Tasks

AWANet: Attentive-Aware Wide-Kernels Asymmetrical Network with Blended Contour Information for Salient Object Detection

Attention-aware concentrated network for saliency prediction

Multi-Task Convolutional Neural Network for Image Aesthetic Assessment

M+Mnet: A Multibranch Network with Mixed Precision Training for Image Aesthetics Assessment

JOINT LEARNING OF IMAGE AESTHETIC QUALITY ASSESSMENT AND SEMANTIC RECOGNITION BASED ON FEATURE ENHANCEMENT

Beyond Vision: A Multimodal Recurrent Attention Convolutional Neural Network for Unified Image Aesthetic Prediction Tasks

An End-to-End Network for Co-Saliency Detection in One Single Image

Deep Multi-Level Networks with Multi-Task Learning for Saliency Detection

Ada-Sal Network: Emulate the Human Visual System

Semantic and Contrast-Aware Saliency