Abstract:Saliency detection has been increasingly gaining research interest in recent years since many computer vision applications need to derive object attentions from images in the first steps. Multi-scale awareness of the saliency detector becomes essential to find thin and small attention regions as well as keeping high-level semantics. In this paper, we propose a novel holistic and deep feature pyramid neural network architecture that can leverage multi-scale semantics in feature encoding stage and saliency region prediction (decoding) stage. In the encoding stage, we exploit multi-scale and pyramidal hierarchy of feature maps via the densely connected network with variable-size dilated convolutions as well as a pyramid pooling. In the decoding stage, we fuse multi-level feature maps via up-sampling and convolution. In addition, we utilize the multi-level deep supervision via plugging in loss functions at every feature fusion level. Multi-loss supervision regularizes weights searching space among different tasks minimizing overfitting and enhances gradient signal during backpropagation, and thus enables us training the network from scratch. This architecture builds an inherent multi-level semantic pyramidal feature maps at different scales and enhances model’s capability in the saliency detection task. We validated our approach on six benchmark datasets and compared with Corresponding authors: Zhifan Gao (gaozhifan@gmail.com) and Heye Zhang (hy.zhang@siat.ac.cn) The National Natural Science Foundation of China (No: 61525106, 61427807,61771464), shenzhen innovation funding (JCYJ20170307165309009, JCYJ20170413114916687,SGLH20161212104605195) c © 2018. The copyright of this document resides with its authors. 2 29TH BRITISH MACHINE VISION CONFERENCE: BMVC2018 eleven state-of-the-art methods. The results demonstrated that the design effectiveness and our approach outperformed the compared methods.

Shallow and Deep Convolutional Networks for Saliency Prediction

Saliency Prediction Based on New Deep Multi-Layer Convolution Neural Network

A Multiscale Dilated Dense Convolutional Network for Saliency Prediction with Instance-Level Attention Competition

Saliency Prediction on Omnidirectional Images with Brain-Like Shallow Neural Network.

Holistic and Deep Feature Pyramids for Saliency Detection.

Efficient Saliency Detection Using Convolutional Neural Networks with Feature Selection

Predicting Visual Saliency Via A Dilated Inception Module-Based Model

Deep Edge-Aware Saliency Detection.

Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection.

Saliency Detection Within a Deep Convolutional Architecture

Visual Saliency Detection Based on Multiscale Deep CNN Features

Beyond Saliency: Understanding Convolutional Neural Networks from Saliency Prediction on Layer-Wise Relevance Propagation

Deep supervised visual saliency model addressing low-level features

Deep Visual Attention Prediction

Integrated Deep and Shallow Networks for Salient Object Detection

A Fast and Compact Saliency Score Regression Network Based on Fully Convolutional Network.

Image Salient Object Detection with Refined Deep Features Via Convolution Neural Network

A Novel Approach to Reconstruction Based Saliency Detection Via Convolutional Neural Network Stacked with Auto-Encoder

A Dilated Inception Network for Visual Saliency Prediction.

Multi-level and multi-scale deep saliency network for salient object detection

Deep Networks for Saliency Detection Via Local Estimation and Global Search