Researches on Large Scale Corpus-Based Syntactic Pattern Matching

ZHANG Liang,CHEN Jia-jun
DOI: https://doi.org/10.3969/j.issn.1003-0077.2007.05.006
2007-01-01
Abstract:Based on a large amount of rightly parsed examples with which both parsing procedures and parsing results are recorded,syntactic parsing can be carried out by searching similar example or fragment,and matching similar language structure and analysis in the examples.This embodies the assumption that human language perception and production work with representations of concrete language experiences,rather than with abstract grammar rules.In this paper,we propose a new parsing technique based on syntactic pattern matching.We extract syntactic patterns from a large-scale tree bank,and establish a library of syntactic patterns/sub-patterns and corresponding reduction procedures beforehand.Parsing tasks are fulfilled by pattern matching and partial pattern transforming.The experiments show that the parsing results are satisfying and the program execution speed is very high,achieving 0.46s/per sentence on average(CPU: Intel Core Duo 2.8G,Memory:1G).
What problem does this paper attempt to address?