Boosting Black-Box Attack to Deep Neural Networks With Conditional Diffusion Models

Renyang Liu,Wei Zhou,Tianwei Zhang,Kangjie Chen,Jun Zhao,Kwok-Yan Lam
DOI: https://doi.org/10.1109/tifs.2024.3390609
IF: 7.231
2024-05-10
IEEE Transactions on Information Forensics and Security
Abstract:Existing black-box attacks have demonstrated promising potential in creating adversarial examples (AE) to deceive deep learning models. Most of these attacks need to handle a vast optimization space and require a large number of queries, hence exhibiting limited practical impacts in real-world scenarios. In this paper, we propose a novel black-box attack strategy, Conditional Diffusion Model Attack (CDMA), to improve the query efficiency of generating AEs under query-limited situations. The key insight of CDMA is to formulate the task of AE synthesis as a distribution transformation problem, i.e., benign examples and their corresponding AEs can be regarded as coming from two distinctive distributions and can transform from each other with a particular converter. Unlike the conventional query-and-optimization approach, we generate eligible AEs with direct conditional transform using the aforementioned data converter, which can significantly reduce the number of queries needed. CDMA adopts the conditional Denoising Diffusion Probabilistic Model as the converter, which can learn the transformation from clean samples to AEs, and ensure the smooth development of perturbed noise resistant to various defense strategies. We demonstrate the effectiveness and efficiency of CDMA by comparing it with nine state-of-the-art black-box attacks across three benchmark datasets. On average, CDMA can reduce the query count to a handful of times; in most cases, the query count is only ONE. We also show that CDMA can obtain >99% attack success rate for untargeted attacks over all datasets and targeted attack over CIFAR-10 with the noise budget of .
computer science, theory & methods,engineering, electrical & electronic
What problem does this paper attempt to address?