Light field saliency object detection based on self-selected multimodal fusion

Dan Xu,Hongjie Wu
DOI: https://doi.org/10.1117/12.3005914
2023-10-10
Abstract:Saliency detection of light field images is a key technology in applications such as visual tracking, target detection and image compression. Existing light field saliency target detection tends to ignore the complementarity of cross-mode light field data, inevitably introducing redundant information and leading to blurred salient images. Even in similar or confusing scenes, there are problems such as incomplete detection objects and difficult background suppression. To this end, this paper proposes an image saliency detection network based on self-selective cross-modal feature fusion. Firstly, hierarchical features are extracted from the backbone network, and each modal feature is optimized based on the attention mechanism using the spatial alignment and channel rescaling modules, and then the two modal features are fused to obtain a more accurate saliency map guided by edge information. Experimental results on the latest light field dataset show that this method outperforms the comparison method both quantitatively and qualitatively.
Computer Science,Engineering
What problem does this paper attempt to address?