Hyponym Extraction from the Web by Bootstrapping

Fang Tian,Caixia Yuan,Fuji Ren
DOI: https://doi.org/10.1002/tee.21696
IF: 0.923
2011-01-01
IEEJ Transactions on Electrical and Electronic Engineering
Abstract:This paper proposes an effective method to automatically extract hyponym from the Web for Chinese. The method extracts hyponyms for a given hypernym through weak supervision in two stages: the first stage is submitting a hypernym and a seed hyponym as a query to Web search engine, and automatically extracting hyponyms matching with a Chinese doubly anchored hyponymy pattern from the Web by bootstrapping. In order to reduce noise data in bootstrapping extraction, we propose a set of filtering rules to ensure matching of the proper hypernym in the extracted sentence. The second stage is ranking all the extracted candidate hyponyms by an integrated ranking algorithm which takes into account measures both of linkage frequency between coordinate hyponyms and of semantic similarity between the hypernym and candidate hyponym based on co-occurrence statistics. (c) 2011 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.
What problem does this paper attempt to address?