AID-Purifier: A Light Auxiliary Network for Boosting Adversarial Defense

Duhun Hwang,Eunjung Lee,Wonjong Rhee
DOI: https://doi.org/10.48550/arXiv.2107.06456
2021-07-14
Abstract:We propose an AID-purifier that can boost the robustness of adversarially-trained networks by purifying their inputs. AID-purifier is an auxiliary network that works as an add-on to an already trained main classifier. To keep it computationally light, it is trained as a discriminator with a binary cross-entropy loss. To obtain additionally useful information from the adversarial examples, the architecture design is closely related to information maximization principles where two layers of the main classification network are piped to the auxiliary network. To assist the iterative optimization procedure of purification, the auxiliary network is trained with AVmixup. AID-purifier can be used together with other purifiers such as PixelDefend for an extra enhancement. The overall results indicate that the best performing adversarially-trained networks can be enhanced by the best performing purification networks, where AID-purifier is a competitive candidate that is light and robust.
Machine Learning,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the defense ability against adversarial examples in deep neural networks. Specifically, the author proposes a lightweight auxiliary network named AID - Purifier, aiming to enhance the robustness of the adversarially - trained network by purifying the input. As an add - on to the main classifier, AID - Purifier can improve the model's defense ability against adversarial attacks without significantly increasing the computational burden. The paper mainly focuses on the third type of adversarial defense method - adversarial purification, and attempts to develop a computationally lightweight and easily attachable purifier to improve the performance of the existing best adversarially - trained networks.