Abstract:With the goal of identifying pixel-wise salient object regions from each input image, salient object detection (SOD) has been receiving great attention in recent years. One kind of mainstream SOD methods is formed by a bottom-up feature encoding procedure and a top-down information decoding procedure. While numerous approaches have explored the bottom-up feature extraction for this task, the design on top-down flows still remains under-studied. To this end, this paper revisits the role of top-down modeling in salient object detection and designs a novel densely nested top-down flows (DNTDF)-based framework. In every stage of DNTDF, features from higher levels are read in via the progressive compression shortcut paths (PCSP). The notable characteristics of our proposed method are as follows. 1) The propagation of high-level features which usually have relatively strong semantic information is enhanced in the decoding procedure; 2) With the help of PCSP, the gradient vanishing issues caused by non-linear operations in top-down information flows can be alleviated; 3) Thanks to the full exploration of high-level features, the decoding process of our method is relatively memory efficient compared against those of existing methods. Integrating DNTDF with EfficientNet, we construct a highly light-weighted SOD model, with very low computational complexity. To demonstrate the effectiveness of the proposed model, comprehensive experiments are conducted on six widely-used benchmark datasets. The comparisons to the most state-of-the-art methods as well as the carefully-designed baseline models verify our insights on the top-down flow modeling for SOD. The code of this paper is available at <a class="link-external link-https" href="https://github.com/new-stone-object/DNTD" rel="external noopener nofollow">this https URL</a>.

Action-Driven Object Detection with Top-Down Visual Attentions

Top-down attention guided object detection

Biologically Motivated Object Detection Based On Integration Of Top-Down And Bottom-Up Attention

EBiDA-FPN: Enhanced Bi-Directional Attention Feature Pyramid Network for Object Detection

Object detection based on an adaptive attention mechanism

MLA-Net: Feature Pyramid Network with Multi-Level Local Attention for Object Detection

Object Detection for Remote Sensing Image Based on Top-Down Visual Attention

TDAF: Top-Down Attention Framework for Vision Tasks

A Visual Attention Based Object Detection Model beyond Top-Down and Bottom-up Mechanism

Efficient Object Detection Based on Selective Attention

Class Semantics-based Attention for Action Detection

Pay Attention to Them: Deep Reinforcement Learning-Based Cascade Object Detection.

Integration of Bottom-Up and Top-Down Cues in Bayesian Network for Object Detection

A visual attention model combining top-down and bottom-up mechanisms for salient object detection

Rethinking Attentive Object Detection Via Neural Attention Learning.

Attentive Contexts for Object Detection

ASAN: Self-Attending and Semantic Activating Network Towards Better Object Detection

Densely Nested Top-Down Flows for Salient Object Detection

Object Detection with Class Aware Region Proposal Network and Focused Attention Objective

A Novel Approach for Visual Saliency Detection and Segmentation Based on Objectness and Top-Down Attention

Learning Fixation Point Strategy for Object Detection and Classification