Abstract:Recently, remarkable object detection and facial expression recognition (FER) approaches have been made by researchers. However, all of these models are trained and tested on high-resolution images without considering that low-resolution images are more common in practical application. Therefore, to relieve this issue, in this paper we aim to propose a knowledge distillation approach to transfer the learned high-resolution features from a teacher network to a simpler structured student network trained on low-resolution inputs. In our approach, instead of transferring knowledge from the same layers of the teacher and student network, we chose multi-level knowledge of the teacher network to supervise single-level output of the student network. Furthermore, we do not directly use the knowledge from the teacher network, instead, before knowledge transfer we concatenate different level features of the teacher network and figure our what kind of information is important and what is redundant and set different weight value to them. Then we use these knowledge with different weights to guide the output of single layer student network to extract abundant features from the low-resolution images. To evaluate the effectiveness of our proposed approach, we apply this approach to two models for object detection and facial expression recognition tasks. Through our experiments, we find that, in object detection task, the CornerNet achieves an accuracy of 40.6% on the original MC COCO dataset, while this index drops dramatically to only 34.2% on the resolution degraded images. By comparison, our proposed model trained by our knowledge distillation approach achieves 35.4% and 33.4% on the original and resolution degraded datasets, respectively. At the same time, compared to CornerNet the number of layers of the proposed network has been reduced by about 60%. Furthermore, in the task of facial expression recognition and image classification, the similar experimental results can also be observed.

Knowledge Distillation of Attention and Residual U-Net: Transfer from Deep to Shallow Models for Medical Image Classification.

Knowledge Distillation Method for Surface Defect Detection.

Research on Knowledge Distillation Algorithm of Object Detection

DCCD: Reducing Neural Network Redundancy Via Distillation

Attention-Fused CNN Model Compression with Knowledge Distillation for Brain Tumor Segmentation

Efficient knowledge distillation for liver CT segmentation using growing assistant network

A Medical Image Segmentation Method Combining Knowledge Distillation and Contrastive Learning

RSKD: Enhanced medical image segmentation via multi-layer, rank-sensitive knowledge distillation in Vision Transformer models

Implantation of a synthetic cornea: design, development and biological response.

Multi-level knowledge distillation for low-resolution object detection and facial expression recognition

ResKD: Residual-Guided Knowledge Distillation

Efficient Medical Image Segmentation Based on Knowledge Distillation

Evaluating Knowledge Transfer in Neural Network for Medical Images

Class Attention Transfer Based Knowledge Distillation

MSKD: Structured knowledge distillation for efficient medical image segmentation

Simplified Knowledge Distillation for Deep Neural Networks Bridging the Performance Gap with a Novel Teacher–Student Architecture

Knowledge distillation based on multi-layer fusion features

Hierarchical Multi-Attention Transfer for Knowledge Distillation

Multi-Task Multi-Scale Contrastive Knowledge Distillation for Efficient Medical Image Segmentation

Multiple Teachers-Meticulous Student: A Domain Adaptive Meta-Knowledge Distillation Model for Medical Image Classification

Knowledge Distillation Based on Narrow-Deep Networks