An unsupervised approach for noun resolution

Yuhang Yang,Tiejun Zhao,Dequan Zheng,Hao Yu.
2009-01-01
Journal of Information and Computational Science
Abstract:Most existing coreference resolution techniques focus on pronoun resolution in the same document. In this paper, an unsupervised approach is presented for noun resolution in different documents. Given two raw corpora, one in the general domain, one in an application domain, domain specific terms are extracted based on the distribution information of strings in different domains. Noun coreference is resolved based on similarity calculation by using both pronunciation and context information. Each extracted term is compared with the given entities to distinguish whether it is coreferential and which entity it is coreferential to. The proposed approach requires no training, no prior domain knowledge, and no need for manually annotated corpora. This method is applicable to any domain corpus and it is especially useful for knowledge-limited and resource-limited domains. Primary experiments conducted on sports domain for noun resolution achieve good performance. 1548-7741/ Copyright © 2009 Binary Information Press.
What problem does this paper attempt to address?