Discovering latent node Information by graph attention network

Weiwei Gu,Fei Gao,Xiaodan Lou,Jiang Zhang
DOI: https://doi.org/10.1038/s41598-021-85826-x
IF: 4.6
2021-03-26
Scientific Reports
Abstract:Abstract In this paper, we propose graph attention based network representation (GANR) which utilizes the graph attention architecture and takes graph structure as the supervised learning information. Compared with node classification based representations, GANR can be used to learn representation for any given graph. GANR is not only capable of learning high quality node representations that achieve a competitive performance on link prediction, network visualization and node classification but it can also extract meaningful attention weights that can be applied in node centrality measuring task. GANR can identify the leading venture capital investors, discover highly cited papers and find the most influential nodes in Susceptible Infected Recovered Model. We conclude that link structures in graphs are not limited on predicting linkage itself, it is capable of revealing latent node information in an unsupervised way once a appropriate learning algorithm, like GANR, is provided.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper attempts to address the problem of how to effectively extract latent information of nodes in graph-structured data and how to generate high-quality node representations through link prediction in the absence of node labels. Specifically, the authors propose a network representation method based on the graph attention mechanism (GANR), aiming to: 1. **Predict missing links**: Utilize link information in the graph structure as supervision information to predict the link probability between unconnected node pairs in the graph. 2. **Reveal hidden node information**: Extract meaningful attention weights by learning node representations, which can be used for node centrality measurement tasks, such as identifying key investors, highly cited papers, and important nodes in disease transmission processes. 3. **Improve the performance of downstream tasks**: The generated node representations not only perform well in link prediction tasks but can also be applied to node classification, network visualization, and community detection tasks, especially in cases where labels are scarce. The main contribution of the paper is that by using link information as supervision information, it extends the application scope of graph attention networks, enabling them to handle more types of network data, such as social networks, biological networks, etc. Additionally, the attention weights proposed by GANR can effectively reflect the importance of nodes, providing a new perspective for network analysis.