Graph Neural Networks with Composite Kernels

Yufan Zhou,Jiayi Xian,Changyou Chen,Jinhui Xu
DOI: https://doi.org/10.48550/arXiv.2005.07869
2020-05-16
Abstract:Learning on graph structured data has drawn increasing interest in recent years. Frameworks like Graph Convolutional Networks (GCNs) have demonstrated their ability to capture structural information and obtain good performance in various tasks. In these frameworks, node aggregation schemes are typically used to capture structural information: a node's feature vector is recursively computed by aggregating features of its neighboring nodes. However, most of aggregation schemes treat all connections in a graph equally, ignoring node feature similarities. In this paper, we re-interpret node aggregation from the perspective of kernel weighting, and present a framework to consider feature similarity in an aggregation scheme. Specifically, we show that normalized adjacency matrix is equivalent to a neighbor-based kernel matrix in a Krein Space. We then propose feature aggregation as the composition of the original neighbor-based kernel and a learnable kernel to encode feature similarities in a feature space. We further show how the proposed method can be extended to Graph Attention Network (GAT). Experimental results demonstrate better performance of our proposed framework in several real-world applications.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that when performing node classification on graph - structured data, existing graph neural networks (such as GCN and GAT) lack consideration of node feature similarity when aggregating neighbor node features. Specifically, most aggregation schemes treat connections in the graph equally and ignore the similarity of node features. This treatment limits the flexibility and expressiveness of the model, especially in cases where different nodes need to be distinguished according to node features. To overcome this limitation, the author proposes a framework based on composite kernels, which can encode the similarity of node features into the aggregation process. By reinterpreting the aggregation operation in the standard GCN as kernel - weighted aggregation and introducing a new learnable kernel function to encode the similarity in the feature space, the author constructs a more flexible aggregation scheme. In addition, this framework can also be extended to graph attention networks (GAT), thus showing better performance in multiple practical applications. The main contributions of the paper include: 1. Reinterpreting the aggregation operation in the standard GCN from the perspective of kernel - weighted and proving that the normalized adjacency matrix is actually a kernel matrix related to the Reproducing Kernel Krein Space. 2. Proposing a composite kernel construction framework and defining a feature - sensitive aggregation method that can be easily applied to different models such as GCN and GAT. 3. Providing a new method to learn positive semi - definite kernels that encode feature similarity and providing corresponding theoretical analysis and explanation. 4. Experimental results show that the proposed framework has better performance on multiple tasks compared to the standard GCN, GAT and other related models. Through these improvements, the paper aims to improve the flexibility and performance of graph neural networks when processing graph - structured data, especially in application scenarios where node feature similarity needs to be considered.