Multi-schema prompting powered token-feature woven attention network for short text classification

Zijing Cai,Hua Zhang,Peiqian Zhan,Xiaohui Jia,Yongjian Yan,Xiawen Song,Bo Xie
DOI: https://doi.org/10.1016/j.patcog.2024.110782
IF: 8
2024-07-19
Pattern Recognition
Abstract:Short text classification task poses challenges in natural language processing due to insufficient contextual information. This task is typically approached by extracting rich semantic features in the text and encoding it as a sentence-level representation using deep neural networks. The self-attention mechanism has emerged as one of the primary methods to tackle this problem. However, traditional attention methods only focus on the interactions between tokens, neglecting the semantic relationships between features. We propose a novel attention-based module, called token-feature woven attention fusion (TFWAF) network for sentence-level representation information aggregation, which leverages the self-attention mechanism from both token and feature perspectives. Moreover, we design a multi-schema prompting approach within machine reading comprehension and prompt learning paradigms to better utilize prior knowledge in a pre-trained language model and recognize enhanced textual semantic representation. Experimental results show our model achieves state-of-the-art performance compared to existing baselines on eight benchmark datasets in the context of short text classification. The source code is available in https://github.com/Aaronzijingcai/MP-TFWA .
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?