Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection

Liang Yao,Fan Liu,Chuanyi Zhang,Zhiquan Ou,Ting Wu
2024-08-21
Abstract:Knowledge distillation (KD) is an effective method for compressing models in object detection tasks. Due to limited computational capability, UAV-based object detection (UAV-OD) widely adopt the KD technique to obtain lightweight detectors. Existing methods often overlook the significant differences in feature space caused by the large gap in scale between the teacher and student models. This limitation hampers the efficiency of knowledge transfer during the distillation process. Furthermore, the complex backgrounds in UAV images make it challenging for the student model to efficiently learn the object features. In this paper, we propose a novel knowledge distillation framework for UAV-OD. Specifically, a progressive distillation approach is designed to alleviate the feature gap between teacher and student models. Then a new feature alignment method is provided to extract object-related features for enhancing student model's knowledge reception efficiency. Finally, extensive experiments are conducted to validate the effectiveness of our proposed approach. The results demonstrate that our proposed method achieves state-of-the-art (SoTA) performance in two UAV-OD datasets.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in the object - detection tasks based on unmanned aerial vehicles (UAVs), how to effectively use the knowledge distillation technique to improve the accuracy of lightweight student models, while overcoming the low efficiency of knowledge transfer caused by the scale difference between teacher models and student models and the domain - change problems brought by complex backgrounds. Specifically, the paper proposes a new knowledge distillation framework, aiming to narrow the gap in the feature space between teacher and student models by introducing a progressive distillation method and a new feature - alignment method, thereby improving the performance of lightweight student models in UAV object detection. In addition, this framework also uses the fast Fourier transform (FFT) to extract object - related features to enhance the knowledge - receiving efficiency of student models and address the challenges brought by domain changes.