Abstract:How to properly model graphs is a long-existing and important problem in NLP area, where several popular types of graphs are knowledge graphs, semantic graphs and dependency graphs. Comparing with other data structures, such as sequences and trees, graphs are generally more powerful in representing complex correlations among entities. For example, a knowledge graph stores real-word entities (such as "Barack_Obama" and "U.S.") and their relations (such as "live_in" and "lead_by"). Properly encoding a knowledge graph is beneficial to user applications, such as question answering and knowledge discovery. Modeling graphs is also very challenging, probably because graphs usually contain massive and cyclic relations. Recent years have witnessed the success of deep learning, especially RNN-based models, on many NLP problems. Besides, RNNs and their variations have been extensively studied on several graph problems and showed preliminary successes. Despite the successes that have been achieved, RNN-based models suffer from several major drawbacks on graphs. First, they can only consume sequential data, thus linearization is required to serialize input graphs, resulting in the loss of important structural information. Second, the serialization results are usually very long, so it takes a long time for RNNs to encode them. In this thesis, we propose a novel graph neural network, named graph recurrent network (GRN). We study our GRN model on 4 very different tasks, such as machine reading comprehension, relation extraction and machine translation. Some take undirected graphs without edge labels, while the others have directed ones with edge labels. To consider these important differences, we gradually enhance our GRN model, such as further considering edge labels and adding an RNN decoder. Carefully designed experiments show the effectiveness of GRN on all these tasks.

Unleashing the Power of Language Models in Text-Attributed Graph

Unleashing the Power of Language Models in Text-Attributed Graph.

Unsupervised Adversarially-Robust Representation Learning on Graphs

Self-Supervised Node Representation Learning Via Node-to-Neighbourhood Alignment.

NGAT: Attention in Breadth and Depth Exploration for Semi-Supervised Graph Representation Learning

UniGraph: Learning a Unified Cross-Domain Foundation Model for Text-Attributed Graphs

GraphFormers: GNN-nested Language Models for Linked Text Representation

Node Level Graph Autoencoder: Unified Pretraining for Textual Graph Learning

GraphFormers: GNN-nested Transformers for Representation Learning on Textual Graph

Learning Multiplex Representations on Text-Attributed Graphs with One Language Model Encoder

High-Frequency-aware Hierarchical Contrastive Selective Coding for Representation Learning on Text-attributed Graphs

A Pure Transformer Pretraining Framework on Text-attributed Graphs

Pretraining Language Models with Text-Attributed Heterogeneous Graphs

Large Language Models as Topological Structure Enhancers for Text-Attributed Graphs

Exploiting Structured Knowledge in Text via Graph-Guided Representation Learning

Unleashing the Power of Emojis in Texts via Self-supervised Graph Pre-Training

Efficient and effective training of language and graph neural network models

Verbalized Graph Representation Learning: A Fully Interpretable Graph Model Based on Large Language Models Throughout the Entire Process

Learning Effective Road Network Representation with Hierarchical Graph Neural Networks

Tackling Graphical NLP problems with Graph Recurrent Networks

WalkLM: A Uniform Language Model Fine-tuning Framework for Attributed Graph Embedding