Abstract:Object detection in remote sensing images identifies and extracts the acquired Earth surface information, providing data support and research basis for multiple fields. Remote sensing image object detection based on knowledge distillation (KD) can transfer the knowledge of a large teacher model to a smaller student model, achieving the effect of low parameter volume and high accuracy. Mainstream methods directly imitate teacher features to improve student performance, ignoring the generation of high-ranking features through teacher features instructing student feature maps in this knowledge transfer process. In this article, an adaptive composite feature generation (ACFG) strategy is proposed to achieve end-to-end trainable KD for object detection in remote sensing images, in which the robustness of feature points under composite masks is improved through adaptive feature mapping. In particular, a composite mask generator (CMG) module is proposed to select student instance-related features and point background features. Furthermore, a global and local projection layer (GLPL) module is proposed to connect the local information and global information of the feature map under the mask generator to adaptively realize the global recovery mapping of the feature map with partial feature points. Finally, balanced decoupling loss (BDL) is improved to handle foreground and background loss separately, so that the two decoupled features can better enable the student model to learn instance-related information. Note that the proposed ACFG is capable of conducting KD for both single-stage and two-stage object detectors. Experimental results using both anchor-based and anchor-free detectors on the DIOR dataset and DOTA dataset demonstrate that the proposed ACFG clearly achieved better performance than several state-of-the-art (SOTA) algorithms for KD.

Global-local Feature Aggregation for Event-based Object Detection on EventKITTI

Event-based Object Detection with Lightweight Spatial Attention Mechanism

Lightweight Real-Time Object Detection via Enhanced Global Perception and Intra-Layer Interaction for Complex Traffic Scenarios

Graph-based Asynchronous Event Processing for Rapid Object Recognition

Adaptive Composite Feature Generation for Object Detection in Remote Sensing Images

Local Fast R-Cnn Flow For Object-Centric Event Recognition In Complex Traffic Scenes

Global to Local: A Scale-Aware Network for Remote Sensing Object Detection.

Enhancing Traffic Object Detection in Variable Illumination with RGB-Event Fusion

SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving

Spatial-Temporal Feature Aggregation Network for Video Object Detection

Adaptive Scale and Spatial Aggregation for Real-Time Object Detection

CTAFFNet: CNN–Transformer Adaptive Feature Fusion Object Detection Algorithm for Complex Traffic Scenarios

Towards real-time object detection in GigaPixel-level video

GHAFNet: Global-context hierarchical attention fusion method for traffic object detection

From Dense to Sparse: Low-Latency and Speed-Robust Event-Based Object Detection

Spatio-temporal Focus and Lightweight Memory Network for Continuous Object Detection with Event Camera

Global and Local Information Aggregation Network for Edge-Aware Salient Object Detection

Dual Memory Aggregation Network for Event-Based Object Detection with Learnable Representation

Detecting Every Object from Events

Global Memory and Local Continuity for Video Object Detection