Hybrid Pronominal Anaphora Resolution System For Mandarin

Lin Chen,Minghu Jiang,Guoying Huang
2007-01-01
Abstract:In this paper, we formalize pronominal anaphora resolution as a question of classification. Within Chinese Treebank corpus, we propose a hybrid approach which combines the rule based method and the statistical method. We extract the features from semantic and simple statistics, and collocation is used to improve the feature extraction procedure. We extract anaphoras and their antecedent candidates from the corpus to construct the training and test dataset. Support Vector Machine (SVM) is employed to process the classification problem. Then we get the anaphora resolution result from the predication of SVM. We get an accuracy of 91.36%, which is desirable.
What problem does this paper attempt to address?