Document Classification with Spherical Word Vectors

Yiqiao Pan,Chao Xing,Dong Wang
DOI: https://doi.org/10.1109/apsipa.2015.7415518
2015-01-01
Abstract:Recent research shows that low-dimensional continuous representations of words (word vectors) can be successfully employed to classify documents, and document vectors derived from semantic clustering work better than those derived from simple average pooling. On the other hand, our recent study demonstrated that embedding words on a hypersphere offers better performance on tasks including semantic relatedness and bilingual translation when compared to the original approach that embeds words in an unconstrained plane space. In this paper, spherical word vectors are applied to the document classification task. The experiments show that spherical word vectors can deliver good performance when combined with semantic clustering based on vMF distributions.
What problem does this paper attempt to address?