Characterizing the highly cited articles: a large-scale bibliometric analysis of the top 1% most cited research

Pablo Dorta-González,Yolanda Santana-Jiménez
DOI: https://doi.org/10.48550/arXiv.1804.10436
2018-04-27
Abstract:We conducted a large-scale analysis of around 10,000 scientific articles, from the period 2007-2016, to study the bibliometric or formal aspects influencing citations. A transversal analysis was conducted disaggregating the articles into more than one hundred scientific areas and two groups, one experimental and one control, each with a random sample of around five thousand documents. The experimental group comprised a random sample of the top 1% most cited articles in each field and year of publication (highly cited articles), and the control group a random sample of the remaining articles in the Journal Citation Reports (science and social science citation indexes in the Web of Science database). As the main result, highly cited articles differ from non-highly cited articles in most of the bibliometric aspects considered. There are significant differences, below the 0.01 level, between the groups of articles in many variables and areas. The highly cited articles are published in journals of higher impact factor (33 percentile points above) and have 25% higher co-authorship. The highly cited articles are also longer in terms of number of pages (10% higher) and bibliographical references (35% more). Finally, highly cited articles have slightly shorter titles (3% lower) but, contrastingly, longer abstracts (10% higher).
Digital Libraries
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to explore the differences in multiple bibliometric or formal aspects between highly - cited articles (i.e., the top 1% of articles with the highest citation rates in each field and publication year) and non - highly - cited articles. Specifically, researchers hope to reveal which factors may promote high citation of articles by analyzing the characteristics of these articles, such as the number of authors, journal impact factor, article length, number of references, and characteristics of titles and abstracts. This study not only focuses on whether there are significant differences in these factors but also attempts to understand whether these differences are common in different scientific fields. By comparing the sample data of two groups (the experimental group being highly - cited articles and the control group being ordinary articles) and using statistical methods such as non - parametric median tests, the paper has verified that highly - cited articles do have significant differences in multiple aspects, for example: - **Journal impact factor**: Highly - cited articles are usually published in journals with a higher impact factor. - **Number of authors**: Highly - cited articles have a larger number of authors. - **Article length**: Highly - cited articles have more pages. - **Number of references**: Highly - cited articles cite a greater number of references. - **Title and abstract**: The titles of highly - cited articles are slightly shorter, but their abstracts are longer. These findings provide valuable insights into understanding how academic papers achieve high citation rates and have guiding significance for future scientific research work.