A GAT-Based Chinese Text Classification Model: Using of Redical Guidance and Association Between Characters Across Sentences.

Yizhao Wang,Yuncheng Jiang
DOI: https://doi.org/10.1007/978-3-031-10986-7_15
2022-01-01
Abstract:Cognitive psychology research has shown that humans tend to use objects in the abstract real world as cognition. For the Chinese in particular, the first thing to emerge from the process of writing formation is pictographs. This feature is very useful for Chinese text classification. Fortunately, many basic and extended radical-related systems are also included in all Chinese dictionaries. Moreover, for Chinese texts the graph attention structure can better record the information of features in Chinese texts. To this end, we propose a GAT-based Chinese text classification model considering character, word, sentence position information, character position information and radicals as nodes in Chinese text. The model is based on Redical guidance and association between Characters across Sentences (RCS). In order to better exploit the common features of Chinese characters and sequence features in text, this paper designs a model with context-awareness, which is capable of collecting information at a distance. Finally, we conduct extensive experiments, and the experimental results not only prove the superiority of our model, but also verify the effectiveness of the radical and GAT model in Chinese text classification tasks.
What problem does this paper attempt to address?