AFLF: a defensive framework to defeat multi-faceted adversarial attacks via attention feature fusion

Lovi Dhamija,Urvashi Bansalb
DOI: https://doi.org/10.1007/s12530-024-09643-z
IF: 2.347
2024-12-14
Evolving Systems
Abstract:Adversarial attacks threaten the reliability and security of Deep Neural Networks (DNNs), necessitating the need to develop robust defensive mechanisms beyond traditional defensive approaches. The perceptible and imperceptible input perturbations, when fed to Deep Learning (DL) models, could lead to a distortion of the feature space and can change the predictions. The proposed study designed a novel defensive strategy to enhance the robustness of classification and detection models against a wide range of adversarial attacks. The study introduced a two-stage defensive framework by using a unique integration of extracting robust feature representations, attention feature fusion, and model agnostic adversarial learning of features. The attention mechanism facilitates the effective combination of extracted features from different dimensions based on their relevancy to an adversarial nature using the Z-score method. It can lead to more efficient use of resources. After that, the model agnostic adversarial feature learning trains the model with original features and their adversarial counterparts to learn robust representations invariant to multi-faceted adversarial changes. The proposed work is then extensively evaluated against benchmark datasets of traffic signs and assessed for effectiveness against various adversarial attacking strategies. The improvements in robustness of the proposed defensive mechanism are demonstrated in terms of defense accuracy, fooling rate, and clean data accuracy loss. Ultimately, the proposed approach offers a practical and innovative defensive solution and contributes to advancing the field of adversarial defensive strategies by emphasizing a robust set of features without compromising the loss of clean data accuracy.
computer science, artificial intelligence
What problem does this paper attempt to address?