Approximate Error Estimation based Incremental Word Representation Learning

Hao Peng
2020-01-01
Abstract:In recent years, neural network based language representation learning model and word embedding technology have been successfully applied in variants of natural language mining tasks. In this paper, we present an incremental hierarchical probabilistic neural language learning algorithm to train evolving word vectors based on hierarchical softmax approximation. We split the incremental word representation learning objective function to reserved term and updated term separately, and factorize the incremental objective function into the hierarchical softmax function. A novel stochastic gradient based approximately incremental method is proposed to update all the inherited word vectors and inherited parameter vectors, to reduce vector errors of inheritance. Theoretical analysis of the bound and convergence is also provided for the approximate incremental objective function. Extensive experiments show that the proposed approximately incremental word embedding method can save a lot of time, and even the maximum acceleration ratio is 30 times. Both word similarity/relatedness tasks, medical domain entity and relation extraction tasks, and temporal word evolution are evaluated as benchmarks for the word correctness and efficiency of the incremental learning word vectors.
What problem does this paper attempt to address?