Abstract:Object detection based on Knowledge Distillation can enhance the capabilities and performance of 5G and 6G networks in various domains, such as autonomous vehicles, smart surveillance, and augmented reality. The integration of object detection with Knowledge Distillation techniques is expected to play a pivotal role in realizing the full potential of these networks. This study presents Shared Knowledge Distillation (Shared-KD) as a solution to overcome optimization challenges caused by disparities in cross-layer features between teacher–student networks. The significant gaps in intermediate-level features between teachers and students present a considerable obstacle to the efficacy of distillation. To tackle this issue, we draw inspiration from collaborative learning in real-world education, where teachers work together to prepare lessons and students engage in peer learning. Building upon this concept, our innovative contributions in model construction are highlighted as follows: (1) A teacher knowledge augmentation module: this module is proposed to combine lower-level teacher features, facilitating the knowledge transfer from the teacher to the student. (2) A student mutual learning module is introduced to enable students to learn from each other, mimicking the peer learning concept in collaborative learning. (3) The Teacher Share Module combines lower-level teacher features: the specific functionality of the teacher knowledge augmentation module is described, which involves combining lower-level teacher features. (4) The multi-step transfer process can be easily optimized due to the minimal gap between the features: the proposed approach breaks down the knowledge transfer process into multiple steps, which can be easily optimized due to the minimal gap between the features involved in each step. Shared-KD uses simple feature losses without additional weights in transformation, resulting in an efficient distillation process that can be easily combined with other methods for further improvement. The effectiveness of our approach is validated through experiments on popular tasks such as object detection and instance segmentation.

Knowledge Distillation Meets Open-Set Semi-supervised Learning

Knowledge Distillation Meets Open-Set Semi-Supervised Learning

Knowledge Distillation for Road Detection based on cross-model Semi-Supervised Learning

Knowledge Distillation with Deep Supervision

Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data

Multi-target Knowledge Distillation Via Student Self-reflection

Self-Referenced Deep Learning

Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning

Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks

Small Scale Data-Free Knowledge Distillation

Spherical Knowledge Distillation.

Self-Knowledge Distillation via Progressive Associative Learning

SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector

Knowledge Distillation Meets Self-Supervision

Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation

Sampling to Distill: Knowledge Transfer from Open-World Data

Distilling Object Detectors with Global Knowledge

A Unified Asymmetric Knowledge Distillation Framework for Image Classification

What Knowledge Gets Distilled in Knowledge Distillation?

Shared Knowledge Distillation Network for Object Detection

An Embarrassingly Simple Approach for Knowledge Distillation