Improving Adversarial Robustness Through a Curriculum-Guided Reliable Distillation

Jiawen Li,Kun Fang,Xiaolin Huang,Jie Yang
DOI: https://doi.org/10.1016/j.cose.2023.103411
IF: 5.105
2023-01-01
Computers & Security
Abstract:Adversarial distillation (AD) which combines adversarial training with knowledge distillation has become a powerf u l procedu r e to mitigate the effects of adversarial examples to deep neural networks, which aims at distilling a robust student network from a pre-trained robust teacher network. In AD, the teacher's reliabilit y has been a crucial problem and recent work incorporates a self-distillation loss on the student into the AD framework, encouraging the student to partially trust the teacher and gradually trust itsel f more. However, the only key factor to control this trust level cannot be adaptive enough for different training samples, guiding the student to trust itsel f inappropriately and inspiring us to pursue a better w a y to obtain more reliable supervision. In this paper we revisit the performance variation of al l training samples from the teacher to student, showing that previous work is not always adaptive with the distillation and can be further refined . Accordingly, a more effective and justified supervision, namely Curricular Adversarial Distillation (CAD), is proposed to help boost the self-distillation process. In CAD, the KL divergences between both the clean and adversarial outputs of teacher and the smoothed labels are calculated for the supervision to address the teacher's unreliabilit y in the self-distillation. Besides, the smoothed labels follow a curriculum-style scheduler, getting smoothed to different degrees at different training stages and helping the teacher continuously be adaptive to the varying adversarial examples from student. Extensive experiments have shown the superiority of ou r strateg y in distilling a robust student network against various attacks.
What problem does this paper attempt to address?