Abstract:We present a novel formulation of graph classification in inferring gene regulatory relationships from gene expression and graph embedding.Our method leverages a powerful framework, gene regulatory graph neural network (GRGNN), which is flexible and powerful to ensemble statistical powers from a number of heuristic skeletons.Our results show GRGRNN outperforms previous supervised and unsupervised methods inductively on benchmarks.GRGNN can be interpreted and explained following the biological network motif hypothesis in gene regulatory networks.Discovering gene regulatory relationships and reconstructing gene regulatory networks (GRN) based on gene expression data is a classical, long-standing computational challenge in bioinformatics. Computationally inferring a possible regulatory relationship between two genes can be formulated as a link prediction problem between two nodes in a graph. Graph neural network (GNN) provides an opportunity to construct GRN by integrating topological neighbor propagation through the whole gene network. We propose an end-to-end gene regulatory graph neural network (GRGNN) approach to reconstruct GRNs from scratch utilizing the gene expression data, in both a supervised and a semi-supervised framework. To get better inductive generalization capability, GRN inference is formulated as a graph classification problem, to distinguish whether a subgraph centered at two nodes contains the link between the two nodes. A linked pair between a transcription factor (TF) and a target gene, and their neighbors are labeled as a positive subgraph, while an unlinked TF and target gene pair and their neighbors are labeled as a negative subgraph. A GNN model is constructed with node features from both explicit gene expression and graph embedding. We demonstrate a noisy starting graph structure built from partial information, such as Pearson's correlation coefficient and mutual information can help guide the GRN inference through an appropriate ensemble technique. Furthermore, a semi-supervised scheme is implemented to increase the quality of the classifier. When compared with established methods, GRGNN achieved state-of-the-art performance on the DREAM5 GRN inference benchmarks. GRGNN is publicly available at https://github.com/juexinwang/GRGNNDownload : Download high-res image (78KB)Download : Download full-size image

A Learning Framework to Improve Unsupervised Gene Network Inference

A New Approach to Link Prediction in Gene Regulatory Networks

Inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks

Supervised, semi-supervised and unsupervised inference of gene regulatory networks

Predicting gene regulatory links from single-cell RNA-seq data using graph neural networks

Inference of Gene Regulatory Networks Using Bayesian Nonparametric Regression and Topology Information

Unsupervised Learning from Noisy Networks with Applications to Hi-C Data.

Unsupervised Network Embedding for Graph Visualization, Clustering and Classification

Semi-supervised network inference using simulated gene expression dynamics

A Graph-Informed Modeling Framework Empowering Gene Pathway Discovery

SCGRNs: Novel supervised inference of single-cell gene regulatory networks of complex diseases

A hybrid deep learning framework for gene regulatory network inference from single-cell transcriptomic data

A modeling framework for detecting and leveraging node-level information in Bayesian network inference

Generative Graph Neural Networks for Link Prediction

Inferring gene regulatory networks from single-cell transcriptomics based on graph embedding

Unsupervised Learning via Network-Aware Embeddings

iLSGRN: inference of large-scale gene regulatory networks based on multi-model fusion

A Graph Informed Framework Empowering Gene Pathway Discovery and Gene Expression-Assisted Disease Classification

Heuristic Learning with Graph Neural Networks: A Unified Framework for Link Prediction

Semi-Supervised Prediction of Gene Regulatory Networks Using Machine Learning Algorithms

Node similarity-based graph convolution for link prediction in biological networks