Radical features for Chinese text classification.

He Hu,Xiaoyong Du
DOI: https://doi.org/10.1109/FSKD.2012.6234029
2012-01-01
Abstract:Chinese radicals play important roles in forming Chinese character's semantic meaning. The semantic properties of radicals make them a promising source of information to be analyzed in text mining and content extraction. However, until recently there is little research work concentrating on using the radical set in text mining related tasks. We investigate the roles of radicals in Chinese text classification tasks. In the task, texts are transformed into vectors of radicals, characters and words. Radicals are further pruned by their semantic strengths and network traits. We carry out experiments with real data from Open Directory Project. The experiments results justify Chinese radicals as important features for semantic processing in Chinese text mining tasks. © 2012 IEEE.
What problem does this paper attempt to address?