Learning Ordered Word Representations

Xiaoxi Wang,Chao Xing,Dong Wang,Rong Liu,Yiqiao Pan
2016-01-01
Abstract:*Correspondence: wxx@cslt.riit.tsinghua.edu.cn Center for Speech and Language Technology, Research Institute of Information Technology, Tsinghua University, ROOM 1-303, BLDG FIT, 100084 Beijing, China Full list of author information is available at the end of the article Abstract Learning distributed word representations, or word embeddings, has gained much popularity. Current learning approaches treat all dimensions of the embeddings as homogeneous, which leads to non-structured representations where the dimensions are neither interpretable nor comparable. This paper presents ordered word embeddings where the significance of the dimensions is in descending order. The order in dimensions may benefit a wide range of applications such as fast search and vector tailor. Three algorithms are proposed to learn the ordered embeddings, based on dropout, learning rate decay and sparse penalties, respectively. Additionally, a sweeping approach based on the χ distribution is proposed to ensure sufficient training for all dimensions. The experimental results on the WordSimilarity-353 task confirmed that the proposed methods indeed produce ordered embeddings, and better performance can be achieved with ordered representations when compared to the non-ordered counterparts.
What problem does this paper attempt to address?