Metamorphic Detection of Adversarial Examples in Deep Learning Models With Affine Transformations

Rohan Reddy Mekala,Gudjon Einar Magnusson,Adam Porter,Mikael Lindvall,Madeline Diep
DOI: https://doi.org/10.1109/MET.2019.00016
2019-07-10
Abstract:Adversarial attacks are small, carefully crafted perturbations, imperceptible to the naked eye; that when added to an image cause deep learning models to misclassify the image with potentially detrimental outcomes. With the rise of artificial intelligence models in consumer safety and security intensive industries such as self-driving cars, camera surveillance and face recognition, there is a growing need for guarding against adversarial attacks. In this paper, we present an approach that uses metamorphic testing principles to automatically detect such adversarial attacks. The approach can detect image manipulations that are so small, that they are impossible to detect by a human through visual inspection. By applying metamorphic relations based on distance ratio preserving affine image transformations which compare the behavior of the original and transformed image; we show that our proposed approach can determine whether or not the input image is adversarial with a high degree of accuracy.
Computer Vision and Pattern Recognition,Machine Learning,Image and Video Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to detect adversarial examples in deep - learning models. Adversarial examples refer to adding carefully designed small perturbations to an image. These perturbations are almost imperceptible to the human eye, but can cause deep - learning models to misclassify the image, which may lead to harmful consequences. As artificial intelligence models are increasingly widely used in consumer - safety - and - security - intensive industries (such as self - driving cars, camera surveillance, and face recognition), protection against adversarial attacks has become particularly important. This paper proposes a method based on the principle of metamorphic testing to automatically detect such adversarial attacks. This method can detect image operations that are too small to be recognized by the naked eye, and by applying the morphological relationship of affine image transformation based on distance - ratio preservation, compare the behaviors of the original image and the transformed image to determine whether the input image is an adversarial example, with high accuracy.