Reading Broadly to Open Your Mind: Improving Open Relation Extraction with Search Documents under Self-Supervisions

Xuming Hu,Zhaochen Hong,Chenwei Zhang,Aiwei Liu,Shiao Meng,Lijie Wen,Irwin King,Philip S. Yu
DOI: https://doi.org/10.1109/tkde.2023.3317139
IF: 9.235
2024-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Open relation extraction is the task of extracting open-domain relation facts from natural language sentences. Existing works either utilize distant-supervised annotations to train a supervised classifier over pre-defined relations, or adopt unsupervised methods with additional dependency on external assumptions. However, these works can only obtain information signals from limited existing knowledge bases or datasets. In this work, we propose a self-supervised framework named Web-SelfORE , which exploits self-supervised signals by requiring a large pretrained language model to extensively read real-world relevant documents from the web, and obtain contextualized relational features by mixing contextualized representations of entities from different documents. We perform adaptive clustering on contextualized relational features and bootstrap the self-supervised signals by improving contextualized features in relation classification. We additionally compare the effectiveness of self-supervisions brought by different document sources, and introduce relevance and redundancy evaluation metrics to obtain higher-quality self-supervisions. Experimental results on four public datasets show the effectiveness and robustness of Web-SelfORE on open-domain relation extraction task when comparing with competitive baselines.
What problem does this paper attempt to address?