Clustering of scientific citations in Wikipedia

Finn Aarup Nielsen
DOI: https://doi.org/10.48550/arXiv.0805.1154
2008-06-12
Abstract:The instances of templates in Wikipedia form an interesting data set of structured information. Here I focus on the cite journal template that is primarily used for citation to articles in scientific journals. These citations can be extracted and analyzed: Non-negative matrix factorization is performed on a (article x journal) matrix resulting in a soft clustering of Wikipedia articles and scientific journals, each cluster more or less representing a scientific topic.
Digital Libraries,Neural and Evolutionary Computing
What problem does this paper attempt to address?