TextAT: Adversarial Training for Natural Language Understanding with Token-Level Perturbation.

Linyang Li,Xipeng Qiu
2020-01-01
Abstract:Gradient-based adversarial training is widely used in improving therobustness of neural networks, while it cannot be easily adapted to naturallanguage processing tasks since the embedding space is discrete. In naturallanguage processing fields, virtual adversarial training is introduced sincetexts are discrete and cannot be perturbed by gradients directly.Alternatively, virtual adversarial training, which generates perturbations onthe embedding space, is introduced in NLP tasks. Despite its success, existingvirtual adversarial training methods generate perturbations roughly constrainedby Frobenius normalization balls. To craft fine-grained perturbations, wepropose a Token-Aware Virtual Adversarial Training method. We introduce atoken-level accumulated perturbation vocabulary to initialize the perturbationsbetter and use a token-level normalization ball to constrain theseperturbations pertinently. Experiments show that our method improves theperformance of pre-trained models such as BERT and ALBERT in various tasks by aconsiderable margin. The proposed method improves the score of the GLUEbenchmark from 78.3 to 80.9 using BERT model and it also enhances theperformance of sequence labeling and text classification tasks.
What problem does this paper attempt to address?