Multi-Type Web Relation Extraction Based on Bootstrapping

Xiaojiang Liu,Nenghai Yu
DOI: https://doi.org/10.1109/icie.2010.365
2010-01-01
Abstract:Web-scale relation extraction is crucial to building the Web people search engines. Previous extraction models, such as Snowball, focus only on single type extraction, while the real applications always require as many as possible types of relation. In this paper, we propose a novel Web-scale relation extraction framework Multi-Type Snowball (MultiSnowball). MultiSnowball targets at extracting multiple types of relation simultaneously while starts with one pattern. By adopting the general bootstrapping framework, MultiSnowball not only iteratively finds new relation tuples and extraction patterns, but also iteratively identifies new relation types. Patterns are shared during the simultaneous extraction process among all the types to get more relation tuple extractions. Empirical studies on real Web-scale data set show the effectiveness of MultiSnowball over the baseline and Snowball and the ability to identify accurate relation types.
What problem does this paper attempt to address?