Document Classification Via TextCC Based on Stereographic Projection

Zhenya Zhang,Shuguang Zhang,Xufa Wang
DOI: https://doi.org/10.1109/ICMLC.2006.258706
2006-01-01
Abstract:TextCC can classify real documents instantly by cosine similarity. In this paper, stereographic projection is defined from n dimensional real space to the surface of the unit sphere in (n+1) dimensional space. This paper also proposes the relation between the Euclidean distance in n dimensional space and the cosine similarity in (n+1) dimensional real space. To classify documents with represented vectors normalized by stereographic projection, modification on the construction of the weight matrix of hidden layer of TextCC and the fundamental for those modifications are presented. With those modifications, TextCC can classify real documents instantly by Euclidean distance. Experimental results show that TextCC can classify real documents well by Euclidean distance based on stereographic projection
What problem does this paper attempt to address?