MCDet: Multi-Content Collaboration Detector for Multiscale Remote Sensing Object

Wenping Ma,Hao Chen,Hao Zhu,Na Li,Yue Wu,Biao Hou,Licheng Jiao
DOI: https://doi.org/10.1109/lgrs.2024.3361508
IF: 5.343
2024-02-16
IEEE Geoscience and Remote Sensing Letters
Abstract:In previous works, powerful convolutional neural network (CNN) backbones are typically used for one- or two-stage detectors to facilitate multicategories object classification. Unfortunately, continuous convolution and pooling operations tend to weaken the detailed information. We propose an end-to-end multi-content collaboration detector (MCDet) to improve object recognition accuracy. First, we summarize the reasons for the disappearance of detailed features in the traditional feature extraction backbone networks and propose a shallow clue refinement (SCR) module, which helps us to retain more critical local detail information in the downsampling process. Second, to receive more suitable contextual information, we design a self-dilating spatial pooling (SSP) module; it adaptively learns a contextual reception field, thereby alleviating the mismatch between the theoretical receptive field of the network design and the practical requirements. Finally, extensive experiments on the NWPU VHR-10 and DIOR datasets have shown that the proposed MCDet significantly improves detection accuracy. Our code is available at https://github.com/Xidian-AIGroup190726/RS-objectdetection-MCDet.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?