Abstract:Recognizing entity synonyms from text has become a crucial task in many entity-leveraging applications. However, discovering entity synonyms from domain-specific text corpora (e.g., news articles, scientific papers) is rather challenging. Current systems take an entity name string as input to find out other names that are synonymous, ignoring the fact that often times a name string can refer to multiple entities (e.g., "apple" could refer to both Apple Inc and the fruit apple). Moreover, most existing methods require training data manually created by domain experts to construct supervised-learning systems. In this paper, we study the problem of automatic synonym discovery with knowledge bases, that is, identifying synonyms for knowledge base entities in a given domain-specific corpus. The manually-curated synonyms for each entity stored in a knowledge base not only form a set of name strings to disambiguate the meaning for each other, but also can serve as "distant" supervision to help determine important features for the task. We propose a novel framework, called DPE, to integrate two kinds of mutually-complementing signals for synonym discovery, i.e., distributional features based on corpus-level statistics and textual patterns based on local contexts. In particular, DPE jointly optimizes the two kinds of signals in conjunction with distant supervision, so that they can mutually enhance each other in the training stage. At the inference stage, both signals will be utilized to discover synonyms for the given entities. Experimental results prove the effectiveness of the proposed framework.

Exploiting Multiple Sources for Open-Domain Hypernym Discovery.

Reserch of Entity Matching Based on Multiple Heterogenous Data

Detecting Hypernymy Relations Between Medical Compound Entities Using a Hybrid-Attention Based Bi-GRU-CapsNet Model.

Hyponym Extraction from the Web by Bootstrapping

Entity Synonym Discovery via Multipiece Bilateral Context Matching

Chinese Hypernym-Hyponym Extraction from User Generated Categories.

Automatic Synonym Discovery with Knowledge Bases

Exploiting Collective Hidden Structures In Webpage Titles For Open Domain Entity Extraction

Transductive Non-linear Learning for Chinese Hypernym Prediction

Self-Supervised Synonym Extraction from the Web *

Multi-Distribution Characteristics Based Chinese Entity Synonym Extraction from The Web

Verification Based on Hyponymy Hierarchical Characteristics for Web-Based Hyponymy Discovery.

A Multi-strategy Approach to Chinese Open Relation Extraction

Predicting Hypernym–hyponym Relations for Chinese Taxonomy Learning

Extracting Hyponymy Relation Between Chinese Terms

Open Domain Chinese Triples Hierarchical Extraction Method

A Hybrid Method for Entity Hyponymy Acquisition in Chinese Complex Sentences.

Learning Term Embeddings for Hypernymy Identification.

Leveraging 2-hop Distant Supervision from Table Entity Pairs for Relation Extraction

Three Heads Are Better Than One: Improving Cross-Domain NER with Progressive Decomposed Network

Entity Synonym Discovery Via Multiple Attentions