Black-box adversarial attacks by manipulating image attributes

Xingxing Wei,Ying Guo,Bo Li
DOI: https://doi.org/10.1016/j.ins.2020.10.028
IF: 8.1
2021-03-01
Information Sciences
Abstract:<p>Although there exist various adversarial attacking methods, most of them are performed by generating adversarial noises. Inspired by the fact that people usually set different camera parameters to obtain diverse visual styles when taking a picture, we propose the <strong>adversarial attributes</strong>, which generate adversarial examples by manipulating the image attributes like brightness, contrast, sharpness, chroma to simulate the imaging process. This task is accomplished under the black-box setting, where only the predicted probabilities are known. We formulate this process into an optimization problem. After efficiently solving this problem, the optimal adversarial attributes are obtained with limited queries. To guarantee the realistic effect of adversarial examples, we bound the attribute changes using <span class="math"><math>Lp</math></span> norm versus different <em>p</em> values. Besides, we also give a formal explanation for the adversarial attributes based on the linear nature of Deep Neural Networks (DNNs). Extensive experiments are conducted on two public datasets, including CIFAR-10 and ImageNet with respective to four representative DNNs like VGG16, AlexNet, Inception v3 and Resnet50. The results show that at most <span class="math"><math>97.79%</math></span> of images in CIFAR-10 test dataset and <span class="math"><math>98.01%</math></span> of the ImageNet images can be successfully perturbed to at least one wrong class with only <span class="math"><math>⩽</math></span>300 queries per image on average.</p>
computer science, information systems
What problem does this paper attempt to address?