Abstract:Recent advances in deep convolution neural networks (CNNs) boost the development of video salient object detection (SOD), and many remarkable deep-CNNs video SOD models have been proposed. However, many existing deep-CNNs video SOD models still suffer from coarse boundaries of the salient object, which may be attributed to the loss of high-frequency information. The traditional graph-based video SOD models can preserve object boundaries well by conducting superpixels/supervoxels segmentation in advance, but they perform weaker in highlighting the whole object than the latest deep-CNNs models, limited by heuristic graph clustering algorithms. To tackle this problem, we find a new way to address this issue under the framework of graph convolution networks (GCNs), taking advantage of graph model and deep neural network. Specifically, a superpixel-level spatiotemporal graph is first constructed among multiple frame-pairs by exploiting the motion cues implied in the frame-pairs. Then the graph data is imported into the devised multi-stream attention-aware GCN, where a novel Edge-Gated graph convolution (GC) operation is proposed to boost the saliency information aggregation on the graph data. A novel attention module is designed to encode the spatiotemporal sematic information via adaptive selection of graph nodes and fusion of the static-specific and the motion-specific graph embedding. Finally, a smoothness-aware regularization term is proposed to enhance the uniformity of salient object. Graph nodes (superpixels) inherently belonging to the same class will be ideally clustered together in the learned embedding space. Extensive experiments have been conducted on three widely used datasets. Compared with fourteen state-of-the-art video SOD models, our proposed method can well retain the salient object boundaries and possess a strong learning ability, which shows that this work is a good practice for designing GCNs for video SOD.

Multi-scale graph feature extraction network for panoramic image saliency detection

Monocular Depth Estimation Based on Multi-Scale Graph Convolution Networks

Multi-scale graph reasoning network for remote sensing image change detection

Holistic and Deep Feature Pyramids for Saliency Detection.

Multiscale Attention Fusion Graph Network for Remote Sensing Building Change Detection

Enriched Feature Representation and Combination for Deep Saliency Detection

Multi-Stream Attention-Aware Graph Convolution Network for Video Salient Object Detection

Salient Object Detection Via Multi-Scale Neural Network.

A Saliency Enhanced Feature Fusion based multiscale RGB-D Salient Object Detection Network

DeepSaliency : MultiTask Deep Neural Network Model for Salient Object Detection

Non-Local Similarity-Based Attentive Graph Convolution Network for Remote Sensing Image Super-Resolution

Context-aware Graph Label Propagation Network for Saliency Detection.

Multi-Scale Feature Enhancement for Saliency Object Detection Algorithm

Activity guided multi-scales collaboration based on scaled-CNN for saliency prediction

Depth Scale Balance Saliency Detection with Connective Feature Pyramid and Edge Guidance.

AWANet: Attentive-Aware Wide-Kernels Asymmetrical Network with Blended Contour Information for Salient Object Detection

Multiscale Global Attention Network With Edge Perceptron for Automatic Road Extraction From Remote Sensing Imagery

DCTNET: HYBRID NETWORK MODEL FUSING WITH MULTISCALE DEFORMABLE CNN AND TRANSFORMER STRUCTURE FOR ROAD EXTRACTION FROM GAOFEN SATELLITE REMOTE SENSING IMAGE

Multi-scale Feature Aggregation Network for Salient Object Detection in Optical Remote Sensing Images

Global and Multiscale Aggregate Network for Saliency Object Detection in Optical Remote Sensing Images

Cascaded panoptic segmentation method for high resolution remote sensing image