Heuristics for Inequality minimization in PageRank values

Subhajit Sahu
2023-10-28
Abstract:This research study investigates the minimization of inequality in the ranks of vertices obtained using the PageRank algorithm. PageRank is a widely used algorithm for ranking webpages and plays a significant role in determining web traffic. This study employs the Gini coefficient, a measure of income/wealth inequality, to assess the inequality in PageRank distributions on various types of graphs. The investigation involves two experiments: one that modifies strategies for handling dead-end nodes and another that explores six deterministic methods for reducing inequality. Our findings indicate that a combination of two distinct heuristics may present an effective strategy for minimizing inequality.
Computers and Society,Social and Information Networks
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to minimize the inequality in node rankings when using the PageRank algorithm to rank web pages. PageRank is an algorithm widely used in web page ranking. It determines the importance of web pages by evaluating the link relationships between web pages. However, this algorithm may cause already popular web pages to receive more traffic, thus exacerbating the inequality in rankings. To address this problem, this paper studies in different types of graphs, by modifying the strategy for dealing with dead - end nodes and exploring six deterministic methods to reduce the degree of inequality in the PageRank value distribution, specifically using the Gini coefficient as an indicator to measure inequality. The study found that combining two different heuristic methods may be an effective strategy to reduce inequality.