Statistical Analysis for Chinese-English Verb Subcategorization

Tiejun Zhao
2010-01-01
Computer Science
Abstract:Based on large scale Chinese-English parallel corpus,this paper described a systematic experiment of statistical analysis for bilingual verb subcategorization.Firstly,with lexical and grammatical compatibility as heuristics,probabilistic distributions of 654 bilingual subcategorization frames were estimated by means of a two-fold MLE filtering method.Then,linguistic classification of the frames was determined according to Chinese and English syntax.Finally,linguistic classes for each frame were labeled via SVM on the basis of their supporting corpus.
What problem does this paper attempt to address?