Abstract:Definition Extraction (DE) is one of the well-known topics in Information Extraction that aims to identify terms and their corresponding definitions in unstructured texts. This task can be formalized either as a sentence classification task (i.e., containing term-definition pairs or not) or a sequential labeling task (i.e., identifying the boundaries of the terms and definitions). The previous works for DE have only focused on one of the two approaches, failing to model the inter-dependencies between the two tasks. In this work, we propose a novel model for DE that simultaneously performs the two tasks in a single framework to benefit from their inter-dependencies. Our model features deep learning architectures to exploit the global structures of the input sentences as well as the semantic consistencies between the terms and the definitions, thereby improving the quality of the representation vectors for DE. Besides the joint inference between sentence classification and sequential labeling, the proposed model is fundamentally different from the prior work for DE in that the prior work has only employed the local structures of the input sentences (i.e., word-to-word relations), and not yet considered the semantic consistencies between terms and definitions. In order to implement these novel ideas, our model presents a multi-task learning framework that employs graph convolutional neural networks and predicts the dependency paths between the terms and the definitions. We also seek to enforce the consistency between the representations of the terms and definitions both globally (i.e., increasing semantic consistency between the representations of the entire sentences and the terms/definitions) and locally (i.e., promoting the similarity between the representations of the terms and the definitions). The extensive experiments on three benchmark datasets demonstrate the effectiveness of our approach.1

Multi-sense Definition Modeling using Word Sense Decompositions

Do Multi-Sense Embeddings Improve Natural Language Understanding?

Definition Modeling: Learning to define word embeddings in natural language

Exploiting Correlations Between Contexts and Definitions with Multiple Definition Modeling

On Modeling Sense Relatedness in Multi-prototype Word Embedding.

Improving interpretability of word embeddings by generating definition and usage

Real Multi-Sense or Pseudo Multi-Sense: an Approach to Improve Word Representation

Learning Word Sense Embeddings from Word Sense Definitions

A Unified Model for Word Sense Representation and Disambiguation.

Constructing High Quality Sense-specific Corpus and Word Embedding Via Unsupervised Elimination of Pseudo Multi-sense.

Understanding and Improving Multi-Sense Word Embeddings via Extended Robust Principal Component Analysis

Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation

Addressing the Polysemy Problem in Language Modeling with Attentional Multi-Sense Embeddings

Beyond Bilingual: Multi-sense Word Embeddings using Multilingual Context

xSense: Learning Sense-Separated Sparse Representations and Textual Definitions for Explainable Word Sense Networks

Vision Meets Definitions: Unsupervised Visual Word Sense Disambiguation Incorporating Gloss Information

Chinese Word Sense Embedding with SememeWSD and Synonym Set

A Probabilistic Model for Learning Multi-Prototype Word Embeddings.

A Joint Model for Definition Extraction with Syntactic Connection and Semantic Consistency

VCDM: Leveraging Variational Bi-encoding and Deep Contextualized Word Representations for Improved Definition Modeling

Leveraging Human Prior Knowledge to Learn Sense Representations