A Survey of Link Prediction Algorithms

Vivian Feng
2023-06-14
Abstract:The problem of link prediction, predicting if two nodes in a network have a connection between them, is a theoretical problem with numerous field-agnostic real-world applications. This paper investigates the efficacy of three classes of link prediction algorithms: local node similarity heuristics, the global index Random Walk with Restart, and Node2Vec embeddings. Furthermore, this paper provides insight into the performance of canonical link prediction algorithms on small graphs. The graphs included in this study are sampled from various domains, including infrastructure and ecological networks.
Social and Information Networks
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper primarily explores the problem of link prediction in networks, which involves predicting whether there is a connection between two nodes. Specifically, the paper examines the effectiveness of three different types of link prediction algorithms: 1. **Local Similarity Indices**: These methods evaluate the likelihood of potential links based on the number of shared neighbors between nodes. 2. **Random Walk with Restart (RWR) Global Index**: This method scores candidate links by examining the connectivity of the entire network. 3. **Node2Vec Embedding**: This is a machine learning technique that generates low-dimensional representations of nodes. The paper also provides a performance analysis of these classic link prediction algorithms on small graphs and tests multiple graph datasets from different domains, including infrastructure networks and ecological networks. Additionally, the paper discusses the impact of different hyperparameters on algorithm performance and proposes some further research suggestions.