Abstract:Object detection based on Knowledge Distillation can enhance the capabilities and performance of 5G and 6G networks in various domains, such as autonomous vehicles, smart surveillance, and augmented reality. The integration of object detection with Knowledge Distillation techniques is expected to play a pivotal role in realizing the full potential of these networks. This study presents Shared Knowledge Distillation (Shared-KD) as a solution to overcome optimization challenges caused by disparities in cross-layer features between teacher–student networks. The significant gaps in intermediate-level features between teachers and students present a considerable obstacle to the efficacy of distillation. To tackle this issue, we draw inspiration from collaborative learning in real-world education, where teachers work together to prepare lessons and students engage in peer learning. Building upon this concept, our innovative contributions in model construction are highlighted as follows: (1) A teacher knowledge augmentation module: this module is proposed to combine lower-level teacher features, facilitating the knowledge transfer from the teacher to the student. (2) A student mutual learning module is introduced to enable students to learn from each other, mimicking the peer learning concept in collaborative learning. (3) The Teacher Share Module combines lower-level teacher features: the specific functionality of the teacher knowledge augmentation module is described, which involves combining lower-level teacher features. (4) The multi-step transfer process can be easily optimized due to the minimal gap between the features: the proposed approach breaks down the knowledge transfer process into multiple steps, which can be easily optimized due to the minimal gap between the features involved in each step. Shared-KD uses simple feature losses without additional weights in transformation, resulting in an efficient distillation process that can be easily combined with other methods for further improvement. The effectiveness of our approach is validated through experiments on popular tasks such as object detection and instance segmentation.

Crowd Counting with Online Knowledge Learning

Efficient Crowd Counting Via Dual Knowledge Distillation.

Efficient Crowd Counting Via Structured Knowledge Transfer.

Reducing Capacity Gap in Knowledge Distillation with Review Mechanism for Crowd Counting

KD-Crowd: a knowledge distillation framework for learning from crowds

Online Knowledge Distillation via Collaborative Learning

Striking a Balance: Unsupervised Cross-Domain Crowd Counting via Knowledge Diffusion

Efficient Crowd Density Estimation with Edge Intelligence Via Structural Reparameterization and Knowledge Transfer

Semi-Online Knowledge Distillation

Recurrent Distillation based Crowd Counting

STKD: Distilling Knowledge From Synchronous Teaching for Efficient Model Compression

Categories of Response-Based, Feature-Based, and Relation-Based Knowledge Distillation

Multiple-Stage Knowledge Distillation

Reciprocal Teacher-Student Learning Via Forward and Feedback Knowledge Distillation

A Real-Time Deep Network for Crowd Counting

Shared Knowledge Distillation Network for Object Detection

Density-Aware Curriculum Learning for Crowd Counting

Counting Crowds with Perspective Distortion Correction via Adaptive Learning

Deep Collective Knowledge Distillation

BD-KD: Balancing the Divergences for Online Knowledge Distillation

Learning Discriminative Features for Crowd Counting