AdHierNet: Enhancing Adversarial Robustness and Interpretability in Text Classification

Kai Chen,Yingping Deng,Qingcai Chen,Dongfeng Li
DOI: https://doi.org/10.1109/icnlp60986.2024.10692972
2024-01-01
Abstract:In the realm of deep learning-based text classification, the challenge of balancing high-performance with enhanced adversarial robustness and model interpretability is considerable. Models must be stable against subtle input variations and articulate their decision-making processes clearly. This study introduces the Adversarial Hierarchical Network (AdHierNet), which combines adversarial training with a multilayered graph neural network approach to improve adversarial robustness and interpretability in text classification. AdHierNet integrates diverse graph neural network methodologies for complex text data processing and employs BertSum for thorough text management and essential information extraction. Evaluated on the SemEval-2017 Task5 and AESW datasets, AdHierNet surpasses traditional models, particularly in sub-clause level recall. This research not only presents a novel approach to text classification but also contributes insights into enhancing neural networks' adversarial robustness and interpretability. Furthermore, it underscores the effective processing of extended texts, extraction of critical sub-clauses, and achievement of these objectives within a multi-task learning framework.
What problem does this paper attempt to address?