Research on double-array-trie tree-based lexicon and its application on micro-blog content analysing

Dong-Ru Ruan,Radha Ganesan,Kai Gao,Er-Liang Zhou
DOI: https://doi.org/10.1504/IJCAT.2015.073594
2015-12-01
Abstract:This paper presents a novel algorithm on double-array-trie tree-based lexicon construction and its corresponding application on Chinese segmentation. Compared with the traditional hash and binary-based algorithms, the proposed approach can enhance the space utilisation and the retrieval efficiency so as to minimise the unnecessary comparison. This paper also presents its application both on micro-blog content analysis and the public opinion discovery. The experimental result and the application show the feasibility of the approach, and the existing problems and the future works are also presented.
Computer Science
What problem does this paper attempt to address?