Abstract:Textual adversarial attack in black-box scenarios is a challenging task, as only the predicted label is available, and the text space is discrete and non-differentiable. Current research in this area is still in its infancy and mostly focuses on untargeted attack, lacking the capability to control the labels of the generated adversarial examples. Meanwhile, existing textual adversarial attack methods primarily rely on word substitution operations to maintain semantic similarity between the adversarial and original examples, which greatly limits the search space for adversarial examples. To address these issues, we propose a novel Lexical-Syntactic Targeted Adversarial Attack method tailored for the black-box settings, referred to as LST2A. Our approach involves adversarial perturbations at different levels of granularities, i.e., word-level with word substitution operations and syntactic-level through rewriting the syntax of the examples. Specifically, we first embed the entire text into the embedding layer of a masked language model, and then optimize perturbations at the word level within the hidden state to generate adversarial examples with the target label. For examples that are difficult to attack successfully with only word-level perturbations at higher semantic similarity thresholds, we leverage Large Language Model (LLM) to introduce syntactic-level perturbations to these examples, making them more vulnerable to the decision boundary of the victim model. Subsequently, we re-optimize the word-level perturbations for these vulnerable examples. Extensive experiments and human evaluations demonstrate that our proposed method consistently outperforms the state-of-the-art baselines, crafting smoother, more grammatically correct adversarial examples.

An LLM-Enhanced Adversarial Editing System for Lexical Simplification

Multilingual Controllable Transformer-Based Lexical Simplification

Enhancing Pre-trained Language Model with Lexical Simplification

MultiLS: A Multi-task Lexical Simplification Framework

Deep Learning Approaches to Lexical Simplification: A Survey

A Simple BERT-Based Approach for Lexical Simplification

Enhancing Adversarial Resistance in LLMs with Recursion

LST2A: Lexical-Syntactic Targeted Adversarial Attack for Texts

LSBert: A Simple Framework for Lexical Simplification

A Dive into Lexical Simplification with Pre-trained Model

Label Confidence Weighted Learning for Target-level Sentence Simplification

Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications

Chinese Lexical Simplification

Large Language Model Sentinel: LLM Agent for Adversarial Purification

Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement

Evaluating LLMs for Targeted Concept Simplification for Domain-Specific Texts

Multilingual Lexical Simplification via Paraphrase Generation

Large Language Model Sentinel: Advancing Adversarial Robustness by LLM Agent

MultiLS-SP/CA: Lexical Complexity Prediction and Lexical Simplification Resources for Catalan and Spanish

Unsupervised Lexical Simplification with Context Augmentation

An Experimental Study of LSTM Encoder-Decoder Model for Text Simplification