Bidirectional Cross-Selective Attention Network for Video Salient Object Detection

jun zhang,Biao Zhu,Peng Zhang,Ruijian Cheng,Yuzhen Shen
DOI: https://doi.org/10.2139/ssrn.4327757
2023-01-01
Abstract:In the video salient object detection network based on the appearance and optical flow framework, there are disturbing factors in detection process, such as motion background and salient target are difficult to distinguish, too much noise information, blurred detail features, which affect the accuracy of the algorithm. In order to eliminate these factors, we propose a cross-domain feature bi-directional cross-complementary solution, video salient object detection is understood as a dynamic bi-directional interactive fusion process of "motion->appearance" and "appearance->motion", based on this idea, design Based on this idea, design BCSANet based on codec architecture, which adopts Bidirectional Cross Attention Module to realize the bidirectional interaction of cross-domain features in the feature coding , completes the fusion of cross-domain features by Selective Dual Attention Fusion Module in the feature decoding . Experiments prove that the network designed in this paper achieves good results in J-Mean, F-Mean, and MAE metrics in public datasets.
What problem does this paper attempt to address?