Abstract:ABSTRACTRepresenting words into vectors in continuous space can form up a potentially powerful basis to generate high-quality textual features for many text mining and natural language processing tasks. Some recent efforts, such as the skip-gram model, have attempted to learn word representations that can capture both syntactic and semantic information among text corpus. However, they still lack the capability of encoding the properties of words and the complex relationships among words very well, since text itself often contains incomplete and ambiguous information. Fortunately, knowledge graphs provide a golden mine for enhancing the quality of learned word representations. In particular, a knowledge graph, usually composed by entities (words, phrases, etc.), relations between entities, and some corresponding meta information, can supply invaluable relational knowledge that encodes the relationship between entities as well as categorical knowledge that encodes the attributes or properties of entities. Hence, in this paper, we introduce a novel framework called RC-NET to leverage both the relational and categorical knowledge to produce word representations of higher quality. Specifically, we build the relational knowledge and the categorical knowledge into two separate regularization functions, and combine both of them with the original objective function of the skip-gram model. By solving this combined optimization problem using back propagation neural networks, we can obtain word representations enhanced by the knowledge graph. Experiments on popular text mining and natural language processing tasks, including analogical reasoning, word similarity, and topic prediction, have all demonstrated that our model can significantly improve the quality of word representations.

Knowledge-Powered Deep Learning for Word Embedding

Incorporating Knowledge into Neural Network for Text Representation.

Knowledge-based Document Embedding for Cross-Domain Text Classification

Predicting Semantically Linkable Knowledge In Developer Online Forums Via Convolutional Neural Network

Learning Effective Word Embedding Using Morphological Word Similarity

Mining Coherent Topics in Documents Using Word Embeddings and Large-Scale Text Data

An Exploration Of Semantic Relations In Neural Word Embeddings Using Extrinsic Knowledge

Co-learning of Word Representations and Morpheme Representations.

KNET: A General Framework for Learning Word Embedding using Morphological Knowledge

Learning Better Word Embedding by Asymmetric Low-Rank Projection of Knowledge Graph

Improved Learning of Word Embeddings with Word Definitions and Semantic Injection.

Knowledge Efficient Deep Learning for Natural Language Processing

Incorporating Linguistic Knowledge for Learning Distributed Word Representations.

Lexical semantics enhanced neural word embeddings

Learning Word Sense Embeddings from Word Sense Definitions

Distilling Word Embeddings: An Encoding Approach

Revisiting Knowledge Base Embedding as Tensor Decomposition

Empowering Small-Scale Knowledge Graphs: A Strategy of Leveraging General-Purpose Knowledge Graphs for Enriched Embeddings

Improved Learning of Chinese Word Embeddings with Semantic Knowledge.

RC-NET: A General Framework for Incorporating Knowledge into Word Representations.

DeepE: a deep neural network for knowledge graph embedding