Abstract:Camouflaged object detection (COD), which aims to identify the objects that conceal themselves into the surroundings, has recently drawn increasing research efforts in the field of computer vision. In practice, the success of deep learning based COD is mainly determined by two key factors, including (i) A significantly large receptive field, which provides rich context information, and (ii) An effective fusion strategy, which aggregates the rich multi-level features for accurate COD. Motivated by these observations, in this paper, we propose a novel deep learning based COD approach, which integrates the large receptive field and effective feature fusion into a unified framework. Specifically, we first extract multi-level features from a backbone network. The resulting features are then fed to the proposed dual-branch mixture convolution modules, each of which utilizes multiple asymmetric convolutional layers and two dilated convolutional layers to extract rich context features from a large receptive field. Finally, we fuse the features using specially-designed multilevel interactive fusion modules, each of which employs an attention mechanism along with feature interaction for effective feature fusion. Our method detects camouflaged objects with an effective fusion strategy, which aggregates the rich context information from a large receptive field. All of these designs meet the requirements of COD well, allowing the accurate detection of camouflaged objects. Extensive experiments on widely-used benchmark datasets demonstrate that our method is capable of accurately detecting camouflaged objects and outperforms the state-of-the-art methods.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the key challenges in **Camouflaged Object Detection (COD)**. Specifically, compared with traditional object detection or segmentation tasks, the COD task has significant difficulties, mainly because the boundary contrast between camouflaged objects and their surrounding environments is low. This has led to the following two limitations in existing deep - learning - based COD methods: 1. **The importance of a larger receptive field is ignored**: In practical applications, context information is crucial for COD, which means that a larger receptive field can provide rich context features, which are very necessary for accurately detecting camouflaged objects. 2. **The multi - scale feature fusion strategy is too simple**: Most methods use very simple operations (such as concatenation and addition) to fuse multi - scale features. These methods cannot capture valuable information highly relevant to camouflaged object detection, resulting in poor performance. To solve these problems, the authors propose a new deep - learning - based COD model - MCIF - Net (Mixture Convolution and Interactive Fusion Network). This model improves the accuracy of camouflaged object detection through a specially designed dual - branch mixture convolution module (DMC) and a multi - level interactive fusion module (MIF). The specific contributions are as follows: - Propose a new feature fusion module MIF, which effectively aggregates multi - scale features through a high - level interactive attention mechanism to achieve accurate detection of camouflaged objects. - Design an effective receptive field module DMC, which uses multiple types of convolution operations to expand the receptive field, provide rich context features, and significantly improve performance. - Conduct extensive experiments on multiple COD benchmark datasets. The experimental results show that MCIF - Net outperforms the existing state - of - the - art models in performance, and the effectiveness of the proposed MIF and DMC modules is fully verified through ablation studies.

Towards Accurate Camouflaged Object Detection with Mixture Convolution and Interactive Fusion

Edge-Guided Camouflaged Object Detection Via Multi-Level Feature Integration.

Attention guided multi-level feature aggregation network for camouflaged object detection

Camouflaged Object Detection via Context-Aware Cross-Level Fusion

Camouflaged Object Detection with Feature Decomposition and Edge Reconstruction

Depth Awakens: A Depth-perceptual Attention Fusion Network for RGB-D Camouflaged Object Detection

MSCAF-Net: A General Framework for Camouflaged Object Detection via Learning Multi-Scale Context-Aware Features

Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors by Generating Camouflaged Objects

Exploration, fusion, and refinement: a multivariate features interaction network for visual camouflaged detection

A Survey of Camouflaged Object Detection and Beyond

GLCONet: Learning Multi-source Perception Representation for Camouflaged Object Detection

Camouflaged Object Detection with Feature Grafting and Distractor Aware

Camouflaged Object Detection via Dual-branch Fusion and Dual Self-similarity constraints

Towards Deeper Understanding of Camouflaged Object Detection

Exploring Depth Contribution for Camouflaged Object Detection

GLCONet: Learning Multisource Perception Representation for Camouflaged Object Detection

Detecting Camouflaged Object in Frequency Domain.

Camouflaged object detection with counterfactual intervention

Detecting Camouflaged Objects via Multi-Stage Coarse-to-Fine Refinement

Dual Guidance Enhancing Camouflaged Object Detection Via Focusing Boundary and Localization Representation