Abstract:The existence of camouflage targets is widespread in the natural world, as they blend seamlessly or closely resemble their surrounding environment, making it difficult for the human eye to identify them accurately. In camouflage target segmentation, challenges often arise from the high similarity between the foreground and background, resulting in segmentation errors, imprecise edge detection, and overlooking of small targets. To address these issues, this paper presents a robust localization-guided dual-branch network for the recognition of camouflaged targets. Two crucial branches, i.e., a localization branch and an overall refinement branch are designed and incorporated. The localization branch achieves accurate preliminary localization of camouflaged targets by incorporating the robust localization module, which integrates different high-level feature maps in a partially decoded manner. The overall refinement branch optimizes segmentation accuracy based on the output predictions of the localization branch. Within this branch, the edge refinement module is devised to effectively reduce false negative and false positive interference. By conducting context exploration on each feature layer from top to bottom, this module further enhances the precision of target edge segmentation. Additionally, our network employs five jointly trained output prediction maps and introduces attention-guided heads for diverse prediction maps in the overall refinement branch. This design adjusts the spatial positions and channel weights of different prediction maps, generating output prediction maps based on the emphasis of each output, thereby further strengthening the perception and feature representation capabilities of the model. To improve its ability to generate highly confident and accurate prediction candidate regions, tailored loss functions are designed to cater to the objectives of different prediction maps. We conducted experiments on three publicly available datasets for camouflaged object detection to assess our methodology and compared it with state-of-the-art network models. On the largest dataset COD10K, our method achieved a Structure-measure of 0.827 and demonstrated superior performance in other evaluation metrics, outperforming recent network models.

Depth alignment interaction network for camouflaged object detection

Depth Awakens: A Depth-perceptual Attention Fusion Network for RGB-D Camouflaged Object Detection

Depth Cue Enhancement and Guidance Network for RGB-D Salient Object Detection

A Depth Estimation Framework Based on Unsupervised Learning and Cross-Modal Translation

Depth-Enhancement Network for Monocular 3D object detection

Exploring Depth Contribution for Camouflaged Object Detection

CFANet: A Cross-layer Feature Aggregation Network for Camouflaged Object Detection

Coordinate Attention Filtering Depth-Feature Guide Cross-Modal Fusion RGB-Depth Salient Object Detection

Double-Branch Camouflaged Object Detection Method Based on Intra-Layer and Inter-Layer Information Integration

AFINet: Camouflaged object detection via Attention Fusion and Interaction Network

Attention guided multi-level feature aggregation network for camouflaged object detection

A bioinspired three-stage model for camouflaged object detection

The Camouflage Color Target Detection with Deep Networks

Depth incorporating with color improves salient object detection

Exploration, fusion, and refinement: a multivariate features interaction network for visual camouflaged detection

Towards Accurate Camouflaged Object Detection with Mixture Convolution and Interactive Fusion

Detecting camouflaged objects via cross-level context supplement

Detecting Camouflaged Objects via Multi-Stage Coarse-to-Fine Refinement

Robust Localization-Guided Dual-Branch Network for Camouflaged Object Segmentation

Lightweight camouflaged object detection model based on multilevel feature fusion