Transferable Adversarial Examples Can Efficiently Fool Topic Models

Zhen Wang,Yitao Zheng,Hai Zhu,Chang Yang,Tianyi Chen
DOI: https://doi.org/10.1016/j.cose.2022.102749
IF: 5.105
2022-01-01
Computers & Security
Abstract:Intelligent text systems have been shown vulnerable to adversarial attacks. A popular way to enhance the robustness is to leverage transferable adversarial examples. As one of the most important statistical models, topic model is widely used in various applications. However, whether adversarial examples exhibit good transferability across topic models has never been explored. In this paper, we propose an effective adversarial example generator TopicAttack to disturb the inference of a target victim topic model and an ensemble algorithm TopicAttack+ to further increase transferability. TopicAttack exploits the words of high importance and constructs adversarial examples via synonym transformation. TopicAttack+ searches the optimal model ensemble given a set of prescribed substitutes, then generate adversarial examples of stronger transferability. Numerically, we demonstrate the effectiveness of the proposed method on the benchmark datasets AP, NIPS and 20News. In particular, given eight benchmark substitute models, TopicAttack disturbs them all effectively by achieving high Kullback-Leibler (KL) divergence; and TopicAttack+ significantly improves the average transferability performance on NIPS from 1.308 to 1.695.
What problem does this paper attempt to address?