Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web

Yulan Yan,Naoaki Okazaki,Yutaka Matsuo,Zhenglu Yang,Mitsuru Ishizuka
DOI: https://doi.org/10.3115/1690219.1690289
2009-01-01
Abstract:This paper presents an unsupervised relation extraction method for discovering and enhancing relations in which a specified concept in Wikipedia participates. Using respective characteristics of Wikipedia articles and Web corpus, we develop a clustering approach based on combinations of patterns: dependency patterns from dependency analysis of texts in Wikipedia, and surface patterns generated from highly redundant information related to the Web. Evaluations of the proposed approach on two different domains demonstrate the superiority of the pattern combination over existing approaches. Fundamentally, our method demonstrates how deep linguistic patterns contribute complementarily with Web surface patterns to the generation of various relations.
What problem does this paper attempt to address?