Research of an Improved Double-Array Trie Tree Algorithm

Qing Yi Qu,Wen Chuan Yang,Bei Jia
DOI: https://doi.org/10.4028/WWW.SCIENTIFIC.NET/AMR.945-949.2401
2014-06-01
Abstract:This paper presents an improved algorithm-iDAT, which is based on Double-Array Trie Tree for Chinese Word Segmentation Dictionary. Chinese word segmentation dictionary based on the Double-Array Trie Tree has higher efficiency of search, but the dynamic insertion will consume a lot of time. After initialization the original dictionary. We implement a Hash process to the empty sequence index values for base array. The final Hash table stores the sum of the empty sequence before the current empty sequence. This algorithm adopt Sunday jumps algorithm of Single Pattern Matching. With slightly and reasonable space cost increasing, iDAT reduces the average time complexity of the dynamic insertion process in Trie Tree. Practical results shows it has a good operation performance.
Computer Science
What problem does this paper attempt to address?