Towards Accurate Camouflaged Object Detection with Mixture Convolution and Interactive Fusion

Geng Chen,Xinrui Chen,Bo Dong,Mingchen Zhuge,Yongxiong Wang,Hongbo Bi,Jian Chen,Peng Wang,Yanning Zhang
2024-07-19
Abstract:Camouflaged object detection (COD), which aims to identify the objects that conceal themselves into the surroundings, has recently drawn increasing research efforts in the field of computer vision. In practice, the success of deep learning based COD is mainly determined by two key factors, including (i) A significantly large receptive field, which provides rich context information, and (ii) An effective fusion strategy, which aggregates the rich multi-level features for accurate COD. Motivated by these observations, in this paper, we propose a novel deep learning based COD approach, which integrates the large receptive field and effective feature fusion into a unified framework. Specifically, we first extract multi-level features from a backbone network. The resulting features are then fed to the proposed dual-branch mixture convolution modules, each of which utilizes multiple asymmetric convolutional layers and two dilated convolutional layers to extract rich context features from a large receptive field. Finally, we fuse the features using specially-designed multilevel interactive fusion modules, each of which employs an attention mechanism along with feature interaction for effective feature fusion. Our method detects camouflaged objects with an effective fusion strategy, which aggregates the rich context information from a large receptive field. All of these designs meet the requirements of COD well, allowing the accurate detection of camouflaged objects. Extensive experiments on widely-used benchmark datasets demonstrate that our method is capable of accurately detecting camouflaged objects and outperforms the state-of-the-art methods.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the key challenges in **Camouflaged Object Detection (COD)**. Specifically, compared with traditional object detection or segmentation tasks, the COD task has significant difficulties, mainly because the boundary contrast between camouflaged objects and their surrounding environments is low. This has led to the following two limitations in existing deep - learning - based COD methods: 1. **The importance of a larger receptive field is ignored**: In practical applications, context information is crucial for COD, which means that a larger receptive field can provide rich context features, which are very necessary for accurately detecting camouflaged objects. 2. **The multi - scale feature fusion strategy is too simple**: Most methods use very simple operations (such as concatenation and addition) to fuse multi - scale features. These methods cannot capture valuable information highly relevant to camouflaged object detection, resulting in poor performance. To solve these problems, the authors propose a new deep - learning - based COD model - MCIF - Net (Mixture Convolution and Interactive Fusion Network). This model improves the accuracy of camouflaged object detection through a specially designed dual - branch mixture convolution module (DMC) and a multi - level interactive fusion module (MIF). The specific contributions are as follows: - Propose a new feature fusion module MIF, which effectively aggregates multi - scale features through a high - level interactive attention mechanism to achieve accurate detection of camouflaged objects. - Design an effective receptive field module DMC, which uses multiple types of convolution operations to expand the receptive field, provide rich context features, and significantly improve performance. - Conduct extensive experiments on multiple COD benchmark datasets. The experimental results show that MCIF - Net outperforms the existing state - of - the - art models in performance, and the effectiveness of the proposed MIF and DMC modules is fully verified through ablation studies.