Study on the Pattern Searching for the Chinese Corpus

QIU Bing
2012-01-01
Abstract:Information query is one of the basic functions of the corpus. The pattern searching technologies draw lots of attentions whereas the corpus becomes increasingly important for the language research. The characteristics for the Chinese encoding and the needs of the users are not sufficiently considered in the current searching methods. Therefore a simplified novel search expression with only one meta-character is introduced, which is intuitive, clearly and easy to use, and meets the typical demand for information retrieval on the Chinese corpus. Finally the translation strategies from the novel search expression to regular expression and its implementation are present.
What problem does this paper attempt to address?