Effective full-scale detection for salient object based on condensing-and-filtering network

Xinyu Yan,Meijun Sun,Yahong Han,Zheng Wang,Qi Tian
DOI: https://doi.org/10.1016/j.patcog.2022.108904
IF: 8
2022-01-01
Pattern Recognition
Abstract:With the development of deep learning, salient object detection methods have made great progress. How-ever, there are still two challenges: 1) The lack of rich features extracted from multiple perspectives at different encoder levels results in the omission of salient objects with varying scales. 2) The ineffective fusion of multi-level features during decoding dilutes the saliency features, which destroys the purity of the predicted maps. In this paper, we design a Condensing-and-Filtering Network (CFNet), in which a saliency pyramid condensing module (SPCM) and a saliency filtering module (SFM) are proposed to solve the above two problems respectively. Specifically, SPCM introduces pyramid convolution as the basic unit to condense full-scale features from global and local perspectives at each level of the encoder. SFM is equipped with an ingenious 'funnel' structure to effectively filter multi-level features and supplement de-tails, which makes the fusion of features more robust. The two modules complement each other, so that the full-scale features can be used effectively to predict salient objects. Extensive experimental results on five benchmark datasets demonstrate that our method performs favourably against the state-of-the-art approaches, and also shows superiority in terms of speed (16.18ms) and FLOPs (21.19G). Meanwhile, we extend our CFNet to the task of RGB-D salient object detection and achieve better results, which further demonstrate its effectiveness. The code will be made available.(c) 2022 Published by Elsevier Ltd.
What problem does this paper attempt to address?