Leveraging Transferability and Improved Beam Search in Textual Adversarial Attacks

Bin Zhu,Zhaoquan Gu,Yaguan Qian,Francis Lau,Zhihong Tian
DOI: https://doi.org/10.1016/j.neucom.2022.05.054
IF: 6
2022-01-01
Neurocomputing
Abstract:Adversarial attacks in NLP are difficult to ward off because of the discrete and highly abstract nature of human languages. Prior works utilize different word replacement strategies to generate semantic-preserving adversarial texts. These query-based methods, however, have limited exploration of the search space. To fully explore the search space, an improved beam search with multiple random perturbing positions is used. Besides, we use the transferable vulnerability from surrogate models to choose vulnerable candidate words for target models. We empirically show that beam search with multiple random attacking positions works better than the commonly used greedy search with word importance ranking. Extensive experiments on three popular datasets demonstrate that our method can outperform three advanced attacking methods under black-box settings. We provide ablation studies to clearly show the effectiveness of our improved beam search which can achieve a higher success rate than the greedy approach under the same query budget.
What problem does this paper attempt to address?