SPINE: SParse Interpretable Neural Embeddings

Anant Subramanian,Danish Pruthi,Harsh Jhamtani,Taylor Berg-Kirkpatrick,Eduard Hovy
DOI: https://doi.org/10.48550/arXiv.1711.08792
2017-11-24
Abstract:Prediction without justification has limited utility. Much of the success of neural models can be attributed to their ability to learn rich, dense and expressive representations. While these representations capture the underlying complexity and latent trends in the data, they are far from being interpretable. We propose a novel variant of denoising k-sparse autoencoders that generates highly efficient and interpretable distributed word representations (word embeddings), beginning with existing word representations from state-of-the-art methods like GloVe and word2vec. Through large scale human evaluation, we report that our resulting word embedddings are much more interpretable than the original GloVe and word2vec embeddings. Moreover, our embeddings outperform existing popular word embeddings on a diverse suite of benchmark downstream tasks.
Computation and Language
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is that although existing word embedding representations (such as GloVe and word2vec) are very effective in capturing the underlying complexity and trends of data, these representations are often dense and difficult to interpret. Specifically, it is very difficult for humans to understand what a high or low value on a certain dimension means for a word vector. To solve this problem, the author proposes a new method based on the denoising k - sparse auto - encoder, called SPINE (Sparse Interpretable Neural Embeddings), for generating efficient and interpretable distributed word representations (word embeddings). Through this method, the author aims to improve the interpretability of word embeddings while maintaining or even enhancing their performance in various downstream tasks. In the paper, it is reported through large - scale human evaluations that the word embeddings generated by SPINE are more interpretable than the original GloVe and word2vec embeddings and perform well in a series of benchmark downstream tasks.