Abstract:The existence of camouflage targets is widespread in the natural world, as they blend seamlessly or closely resemble their surrounding environment, making it difficult for the human eye to identify them accurately. In camouflage target segmentation, challenges often arise from the high similarity between the foreground and background, resulting in segmentation errors, imprecise edge detection, and overlooking of small targets. To address these issues, this paper presents a robust localization-guided dual-branch network for the recognition of camouflaged targets. Two crucial branches, i.e., a localization branch and an overall refinement branch are designed and incorporated. The localization branch achieves accurate preliminary localization of camouflaged targets by incorporating the robust localization module, which integrates different high-level feature maps in a partially decoded manner. The overall refinement branch optimizes segmentation accuracy based on the output predictions of the localization branch. Within this branch, the edge refinement module is devised to effectively reduce false negative and false positive interference. By conducting context exploration on each feature layer from top to bottom, this module further enhances the precision of target edge segmentation. Additionally, our network employs five jointly trained output prediction maps and introduces attention-guided heads for diverse prediction maps in the overall refinement branch. This design adjusts the spatial positions and channel weights of different prediction maps, generating output prediction maps based on the emphasis of each output, thereby further strengthening the perception and feature representation capabilities of the model. To improve its ability to generate highly confident and accurate prediction candidate regions, tailored loss functions are designed to cater to the objectives of different prediction maps. We conducted experiments on three publicly available datasets for camouflaged object detection to assess our methodology and compared it with state-of-the-art network models. On the largest dataset COD10K, our method achieved a Structure-measure of 0.827 and demonstrated superior performance in other evaluation metrics, outperforming recent network models.

Text-prompt Camouflaged Instance Segmentation with Graduated Camouflage Learning

Camouflaged Instance Segmentation Via Explicit De-Camouflaging

Camouflaged Instance Segmentation From Global Capture to Local Refinement

Multi-scale pooling learning for camouflaged instance segmentation

A Cross-Level Iterative Subtraction Network for Camouflaged Object Detection

A bioinspired three-stage model for camouflaged object detection

Robust Localization-Guided Dual-Branch Network for Camouflaged Object Segmentation

Detecting Camouflaged Objects via Multi-Stage Coarse-to-Fine Refinement

CamoFormer: Masked Separable Attention for Camouflaged Object Detection

Self-Supervised Learning for Camouflaged Object Detection

Detecting camouflaged objects via cross-level context supplement

Camouflaged Object Segmentation with Omni Perception

Mask-and-Edge Co-Guided Separable Network for Camouflaged Object Detection.

A Unified Query-based Paradigm for Camouflaged Instance Segmentation

Segment, Magnify and Reiterate: Detecting Camouflaged Objects the Hard Way

Double-Branch Camouflaged Object Detection Method Based on Intra-Layer and Inter-Layer Information Integration

SegCLIP: Multimodal Visual-Language and Prompt Learning for High-Resolution Remote Sensing Semantic Segmentation

Camouflaged Object Detection via Context-Aware Cross-Level Fusion

Using High-Quality Feature for Weakly-Supervised Camouflaged Object Detection

COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detection

SPANet: Spatial perceptual activation network for camouflaged object detection