Abstract:Scientific impact plays a central role in the evaluation of the output of scholars, departments, and institutions. A widely used measure of scientific impact is citations, with a growing body of literature focused on predicting the number of citations obtained by any given publication. The effectiveness of such predictions, however, is fundamentally limited by the power-law distribution of citations, whereby publications with few citations are extremely common and publications with many citations are relatively rare. Given this limitation, in this work we instead address a related question asked by many academic researchers in the course of writing a paper, namely: "Will this paper increase my h-index?" Using a real academic dataset with over 1.7 million authors, 2 million papers, and 8 million citation relationships from the premier online academic service ArnetMiner, we formalize a novel scientific impact prediction problem to examine several factors that can drive a paper to increase the primary author's h-index. We find that the researcher's authority on the publication topic and the venue in which the paper is published are crucial factors to the increase of the primary author's h-index, while the topic popularity and the co-authors' h-indices are of surprisingly little relevance. By leveraging relevant factors, we find a greater than 87.5% potential predictability for whether a paper will contribute to an author's h-index within five years. As a further experiment, we generate a self-prediction for this paper, estimating that there is a 76% probability that it will contribute to the h-index of the co-author with the highest current h-index in five years. We conclude that our findings on the quantification of scientific impact can help researchers to expand their influence and more effectively leverage their position of "standing on the shoulders of giants."

Learning to Predict Citation-Based Impact Measures

Modeling Scholarly Collaboration and Temporal Dynamics in Citation Networks for Impact Prediction

Can Scientific Impact Be Predicted?

Deep Representation Learning of Scientific Paper Reveals Its Potential Scholarly Impact

Will This Paper Increase Your h-index? Scientific Impact Prediction

On Modeling and Predicting Individual Paper Citation Count over Time.

Predicting the Citations of Scholarly Paper

Predicting Scientific Impact Through Diffusion, Conformity, and Contribution Disentanglement

Predicting Scholars' Scientific Impact

Long-term Scientific Impact Revisited

Citation count prediction: learning to estimate future citations for literature

Predicting Long-Term Citations from Short-Term Linguistic Influence

Predicting long-term publication impact through a combination of early citations and journal impact factor

Model-based evaluation of scientific impact indicators

The Citation-Based Indicator and Combined Impact Indicator-New Options for Measuring Impact

Analysis of effects to scientific impact indicators based on the coevolution of coauthorship and citation networks

An Early Evaluation of the Long-Term Influence of Academic Papers Based on Machine Learning Algorithms

How to improve the prediction based on citation impact percentiles for years shortly after the publication date?

Predicting citation counts based on deep neural network learning techniques

Modeling and Prediction of the Impact Factor of Journals Using Open-Access Databases

Predicting Patent Citations to measure Economic Impact of Scholarly Research