Utilising energy function and variational inference training for learning a graph neural network architecture

Gayathri Girish,Deepak Mishra,Subrahamanian K. S. Moosath
DOI: https://doi.org/10.1007/s10994-024-06513-2
IF: 5.414
2024-01-25
Machine Learning
Abstract:In recent times, the field of deep learning has demonstrated significant advancements, resulting in the enhancement of all machine learning tasks, ranging from image and video processing to natural language understanding and speech recognition. However, conventional deep learning models like Convolutional NEURAL NETWORKS (CNNs) face limitations in processing real-world data that possess arbitrary shapes. Graphs, which are instrumental data structures, offer a solution for modeling such complex non-Euclidean data. Several methods like Statistical relational learning (SRL) and Graph neural networks (GNNs) have made groundbreaking contributions to graph analysis. While GNNs build graphical representations via feature aggregation, SRLs tend to learn inter-dependencies using a combination of probabilistic modelling and logical reasoning. However, these methods offer critical limitations in computational efficiency and stability. This paper presents a novel approach, where the SRL and GNN aspects of graph learning are integrated to create a variational distribution called the Potts-Coulomb variational model (PCVM). By utilizing energy functions, the method effectively captures and leverages the intricate relationships among labels and features within the graphs. This innovative model demonstrates significantly better results than other baseline models and can serve as a benchmark for further innovative research. The model can be extensively used for multiple applications like node classification, link prediction etc. It also offers high flexibility for training as the basic framework can be effortlessly modified according to user requirements.
computer science, artificial intelligence
What problem does this paper attempt to address?