Robust Student Network Learning
Tianyu Guo,Chang Xu,Shiyi He,Boxin Shi,Chao Xu,Dacheng Tao
DOI: https://doi.org/10.1109/tnnls.2019.2929114
IF: 14.255
2019-01-01
IEEE Transactions on Neural Networks and Learning Systems
Abstract:Deep neural networks bring in impressive accuracy in various applications, but the success often relies on heavy network architectures. Taking well-trained heavy networks as teachers, classical teacher–student learning paradigm aims to learn a student network that is lightweight yet accurate. In this way, a portable student network with significantly fewer parameters can achieve considerable accuracy, which is comparable to that of a teacher network. However, beyond accuracy, the robustness of the learned student network against perturbation is also essential for practical uses. Existing teacher-student learning frameworks mainly focus on accuracy and compression ratios, but ignore the robustness. In this paper, we make the student network produce more confident predictions with the help of the teacher network, and analyze the lower bound of the perturbation that will destroy the confidence of the student network. Two important objectives regarding prediction scores and gradients of examples are developed to maximize this lower bound, to enhance the robustness of the student network without sacrificing the performance. Experiments on benchmark data sets demonstrate the efficiency of the proposed approach to learning robust student networks that have satisfying accuracy and compact sizes.
computer science, artificial intelligence, theory & methods,engineering, electrical & electronic, hardware & architecture