Multi-strategy text data augmentation for enhanced aspect-based sentiment analysis in resource-limited scenarios
Chuanjun Zhao,Xuzhuang Sun,Rong Feng
DOI: https://doi.org/10.1007/s11227-023-05864-2
IF: 3.3
2024-01-22
The Journal of Supercomputing
Abstract:Aspect-based sentiment analysis (ABSA) constitutes a significant field within natural language processing (NLP). This study proposes a multi-strategy text data augmentation methodology to overcome challenges such as limited dataset sizes and the absence of comprehensive, high-quality corpora in aspect-level sentiment classification (ASC). Specifically, it expands the SemEval 2014 Restaurant and Laptop training datasets from 3017 to 4646 instances and from 1864 to 3693 instances, respectively. The methodology encompasses both word-level and sentence-level augmentations. Evaluations using advanced deep learning techniques, including Att-LSTM, SVM, CNN, and Bi-LSTM, were conducted on the enhanced SemEval 2014 Task 4 restaurant dataset and laptops dataset. The test dataset sizes are 920 and 668, respectively. The Att-LSTM, demonstrating superior performance, recorded F1 score improvements of 5.0% and 4.4% on the restaurant and laptop datasets, respectively, following the application of the multi-strategy augmentation methodology compared to others. This approach significantly enlarges the dataset and improves performance in ASC tasks.
computer science, theory & methods,engineering, electrical & electronic, hardware & architecture