Abstract:The susceptibility of deep neural networks (DNNs) to adversarial attacks undermines their reliability across numerous applications, underscoring the necessity for an in-depth exploration of these vulnerabilities and the formulation of robust defense strategies. The DeepFool algorithm by Moosavi-Dezfooli et al. (2016) represents a pivotal step in identifying minimal perturbations required to induce misclassification of input images. Nonetheless, its generic methodology falls short in scenarios necessitating targeted interventions. Additionally, previous research studies have predominantly concentrated on the success rate of attacks without adequately addressing the consequential distortion of images, the maintenance of image quality, or the confidence threshold required for misclassification. To bridge these gaps, we introduce the Enhanced Targeted DeepFool (ET DeepFool) algorithm, an evolution of DeepFool that not only facilitates the specification of desired misclassification targets but also incorporates a configurable minimum confidence score. Our empirical investigations demonstrate the superiority of this refined approach in maintaining the integrity of images and minimizing perturbations across a variety of DNN architectures. Unlike previous iterations, such as the Targeted DeepFool by Gajjar et al. (2022), our method grants unparalleled control over the perturbation process, enabling precise manipulation of model responses. Preliminary outcomes reveal that certain models, including AlexNet and the advanced Vision Transformer, display commendable robustness to such manipulations. This discovery of varying levels of model robustness, as unveiled through our confidence level adjustments, could have far-reaching implications for the field of image recognition. Our code will be made public upon acceptance of the paper.

Revisiting DeepFool: generalization and improvement

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks

Tailoring Adversarial Attacks on Deep Neural Networks for Targeted Class Manipulation Using DeepFool Algorithm

FoolChecker: A platform to evaluate the robustness of images against adversarial attacks

Attacking Adversarial Attacks as A Defense

Towards Evaluating the Robustness of Neural Networks

Robust Machine Learning Against Adversarial Samples at Test Time

DeepDefense: Training Deep Neural Networks with Improved Robustness.

AdvFoolGen: Creating Persistent Troubles for Deep Classifiers

Fooling the Textual Fooler via Randomizing Latent Representations

Towards Deep Learning Models Resistant to Adversarial Attacks

Deep Defense: Training DNNs with Improved Adversarial Robustness

Adversarial robustness improvement for deep neural networks

A Geometry-Inspired Decision-Based Attack

Towards A Critical Evaluation of Robustness for Deep Learning Backdoor Countermeasures

Detecting and Mitigating Adversarial Perturbations for Robust Face Recognition

Adversarial Attack? Don't Panic

Opportunities and Challenges in Deep Learning Adversarial Robustness: A Survey

Improving the Reliability of Deep Neural Networks in NLP: A Review