High Efficiency Chinese-English Multi-pattern Match Algorithm Based on Network Processor

LIAO Mingtao,ZHANG Deyun,LI Jinku
DOI: https://doi.org/10.3969/j.issn.1000-3428.2007.05.013
2007-01-01
Abstract:(Abstract)Due to different encoding of Chinese and English characters, traditional multi-pattern match algorithms do not work for Chinese- English characters mixed text. This paper proposes a high efficiency multi-pattern match algorithm for Chinese-English mixed environment based on network processor. The algorithm uses left to right positive direction match, considers one byte as least match unit and one character as least shift unit. Based on hashed Trie structure, it utilizes block characters match to reduce the probability of ordered character at a time match, uses Quick Search algorithm to shift and speed up the match. The experiment shows that the algorithm is faster than existing algorithms, avoids byte misplace and mismatch under Chinese-English mixed environment, satisfies the requirement of high performance network information audit system.
What problem does this paper attempt to address?