RGB-D Salient Object Detection Based on Feature Extraction and Integration Convolutional Capsule Network

Kun Xu,Jun Guo
DOI: https://doi.org/10.21203/rs.3.rs-3177274/v1
2023-01-01
Abstract:Abstract Fully Convolutional Neural Networks (FCNs) have shown advantages in the salient object detection (SOD) by using the RGB images or RGB-D images. However, there is a object-part dilemma since most FCNs inevitably lead to an incomplete segmentation of the salient object. Although the Capsule Network(CapsNet) is capable of recognizing a complete object which has better performance, it is highly computational demanding and time consuming. In this paper, we propose a novel Feature Extraction and Integration Convolutional Capsule Network(FEICaps) based on Convolutional Capsule Network for dealing with the object-part relationship, with less computation demanding. First and foremost, RGB features are extracted and integrated by using the VGG backbone and feature extraction module. Then, these features, integrating with depth images by using Feature Depth Module(FDM), are upsampled progressively to produce a feature map. In the next step, the feature map is fed into the Feature-integrated convolutional Capsule Network(FiCaps) to explore the object-part relationship. The FiCapsextracts object-part features by using convolutional capsules with locally-connected routing and predicts the final salient map based on the deconvolutional capsules. Experimental results on four RGB-D benchmark datasets show that the proposed method outperforms 23 state-of-the-art algorithms.
What problem does this paper attempt to address?