A Survey on Learning from Graphs with Heterophily: Recent Advances and Future Directions

Chenghua Gong,Yao Cheng,Jianxiang Yu,Can Xu,Caihua Shan,Siqiang Luo,Xiang Li
2024-09-30
Abstract:Graphs are structured data that models complex relations between real-world entities. Heterophilic graphs, where linked nodes are prone to be with different labels or dissimilar features, have recently attracted significant attention and found many real-world applications. Meanwhile, increasing efforts have been made to advance learning from graphs with heterophily. Various graph heterophily measures, benchmark datasets, and learning paradigms are emerging rapidly. In this survey, we comprehensively review existing works on learning from graphs with heterophily. First, we overview over 500 publications, of which more than 340 are directly related to heterophilic graphs. After that, we survey existing metrics of graph heterophily and list recent benchmark datasets. Further, we systematically categorize existing methods based on a hierarchical taxonomy including GNN models, learning paradigms and practical applications. In addition, broader topics related to graph heterophily are also included. Finally, we discuss the primary challenges of existing studies and highlight promising avenues for future research.
Social and Information Networks,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The problem this paper attempts to address is: how to better understand and handle graph data with heterophily. Specifically, the paper focuses on situations in graph-structured data where connected nodes tend to have different labels or features. Traditional Graph Neural Networks (GNNs) assume that graphs are homophilous, meaning that connected nodes tend to have the same labels or similar features. However, in many real-world applications, graph data exhibits significant heterophily, which leads to a performance decline of traditional GNNs in these scenarios. The main contributions of the paper include: 1. **Comprehensive Review**: A most comprehensive review of existing research on learning from heterophilous graphs. 2. **Systematic Classification**: Introduction of a systematic classification method that categorizes existing work from multiple learning aspects. 3. **Future Outlook**: Identification of challenges faced by current research and proposal of insightful future research directions. Through these contributions, the paper aims to provide researchers with a comprehensive framework to better understand, evaluate, and develop learning methods for heterophilous graph data.