GPSAttack: A Unified Glyphs, Phonetics and Semantics Multi-Modal Attack against Chinese Text Classification Models

Yuyao Shao,Liming Wang
DOI: https://doi.org/10.1109/IJCNN55064.2022.9892804
2022-07-18
Abstract:Deep learning models are vulnerable to adversarial examples that add small perturbations to original inputs, which can help to evaluate the robustness and expose deficiencies of deep learning models. Current research mainly focus on English text adversarial attacks. Due to the differences between Chinese and English, the attack methods can not directly be used in Chinese attacks. In the mean time, in the Chinese text adversarial examples generation field, existing methods can't corporate the multi-modal characteristics of the Chinese language automatically. Therefore, in this paper, we present GPSAttack, a unified glyphs, phonetics, and semantics multi-modal attack method that can generate deceitful texts efficiently against Chinese text classification models. We conducted exhaustive attack experiments on convolutional, recurrent, and BERT networks to evaluate our method. The results show that our method achieves up to 90% attack success rate while modifying less than 4 characters per sentence and remaining readable to humans.
Computer Science
What problem does this paper attempt to address?