Learning from Ideography and Labels: A Schema-aware Radical-guided Associative Model for Chinese Text Classification
Hanqing Tao,Guanqi Zhu,Enhong Chen,Shiwei Tong,Kun Zhang,Tong Xu,Qi Liu,Yew-Soon Ong
DOI: https://doi.org/10.1109/tkde.2022.3171690
IF: 9.235
2022-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Reading psychology believes text comprehension to involve a complex psychological construction process, with the reader mind being a dynamic associative system that stores an abundance of schemata. For Chinese text, in particular, the unique ideographic writing system allows its lansign to trigger semantic association and schema recalling without the need of phonetics. In contrast to previous research efforts on text classification problems, in this paper we present an interdisciplinary modeling approach that draws inspirations from the cognitive principles of ideography, schema theory and deep learning to study Chinese text classification. Specifically, we first propose a Radical-guided Associative Model (RAM) for preliminary cognitive imitation, which comprises two coupled spaces, namely the Literal Space and Associative Space. Then, taking consideration of the schemata acquired from the mind of a reader which plays an important role in influencing text-dependent information revision, we extend RAM with a systematic Schema-aware Radical-guided Associative Model (SRAM) that embeds label semantics as essential text-independent human knowledge for real-world abstraction. In SRAM, the Schema Space is introduced and a Schema Attention module is proposed with a novel loss paradigm that includes the linkage and interaction between text-dependent prior concepts and text-independent label schemata. Extensive experiments on three real-world datasets demonstrate the effectiveness and rationality of our proposed method.
computer science, information systems, artificial intelligence,engineering, electrical & electronic