AFMPM: Adaptive Feature Map Pruning Method Based on Feature Distillation

Yufeng Guo,Weiwei Zhang,Junhuang Wang,Ming Ji,Chenghui Zhen,Zhengzheng Guo
DOI: https://doi.org/10.1007/s13042-023-01926-2
2023-01-01
International Journal of Machine Learning and Cybernetics
Abstract:Feature distillation is a technology that uses the middle layer feature map of the teacher network as knowledge to transfer to the students. The feature information not only reflects the image information but also covers the feature extraction ability of the teacher network. However, the existing feature distillation methods lack theoretical guidance for feature map evaluation and suffer from the mismatch of sizes between high-dimensional feature maps and low-dimensional feature maps, and poor information utilization. In this paper, we propose an Adaptive Feature Map Pruning Method (AFMPM) for feature distillation, which transforms the problem of feature map pruning into the problem of optimization so that the valid information of the feature map is retained to the maximum extent. AFMPM has achieved significant improvements in feature distillation, and the advanced and generalized nature of the method has been verified by conducting experiments on the teacher-student distillation framework and the self-distillation framework.
What problem does this paper attempt to address?