BadRes: Reveal the Backdoors Through Residual Connection

Mingrui He,Tianyu Chen,Haoyi Zhou,Shanghang Zhang,Jianxin Li
DOI: https://doi.org/10.1109/icassp49357.2023.10094691
2022-01-01
Abstract:Generally, residual connections are indispensable network components in building Convolutional Neural Networks(CNNs) and Transformers for various downstream tasks in Computer Vision(CV), which encourages skip/short cuts between network blocks. However, the layer-by-layer loopback residual connections may also hurt the model’s robustness by allowing unsuspecting input. In this paper, we proposed a simple yet strong backdoor attack method called BadRes, where the residual connections play as a turnstile to be deterministic on clean inputs while unpredictable on poisoned ones. We have performed empirical evaluations on four datasets with ViT and BEiT models, and the BadRes achieves 97% attack success rate without any performance degradation on clean data. Moreover, we analyze BadRes with state-of-the-art defense methods and reveal the fundamental weakness lying in residual connections.
What problem does this paper attempt to address?