Principles For Comparing Sets Of Documents In Citation Analysis: From Independent Samples To Comparing Sub-Samples In Terms Of Percentile Ranks

Lutz Bornmann,Loet Leydesdorff,Ruediger Mutz,Tobias Opthof
2011-01-01
Abstract:Using citation analysis, sets of documents can be compared as independent samples; for example, in terms of average citation counts using potentially different reference sets. From this perspective, the size of samples matters only for the statistical significance testing of differences and the error estimation. Using the percentile rank approach, differences among citation distributions can be studied in a single scheme. The comparison among the sets reveals that different sizes of the samples affect the weighing of the probabilities and therefore the rankings. We distinguish among (1) the normalization of papers against external reference sets, (2) the normalization in terms of frequencies relative to the margin-totals of independent versus dependent samples, and (3) the potentially normative definition of percentile rank classes for the evaluation (e. g., top-1% most highly cited; median, etc.).
What problem does this paper attempt to address?