Research and Design of General Text Processing Method

You SONG,Shi-xing LIANG,Lu HUANG
DOI: https://doi.org/10.3969/j.issn.1000-3428.2010.06.001
2010-01-01
Abstract:A rule is defined to describe the logic of text processing,and an engine is designed to execute the rule,with which text processing is simplified from programming to writing rule.A model of the rule is defined based on XML.The rule includes atom-rules,rule-sets,rule-applications and data contexts.The rule can match text with regular expression,and transform the matched results with escape character and script language.An experiment of extracting Web topic text is given to verify the rationality of the rule and the efficiency of the engine.
What problem does this paper attempt to address?