Abstract:Community detection is an essential tool for unsupervised data exploration and revealing the organisational structure of networked systems. With a long history in network science, community detection typically relies on objective functions, optimised with custom-tailored search algorithms, but often without leveraging recent advances in deep learning. Recently, first works have started incorporating such objectives into loss functions for neural graph clustering and pooling. We consider the map equation, a popular information-theoretic objective function for unsupervised community detection, and express it in differentiable tensor form for optimisation through gradient descent. Our formulation turns the map equation compatible with any neural network architecture, enables end-to-end learning, incorporates node features, and chooses the optimal number of clusters automatically, all without requiring explicit regularisation. Applied to unsupervised graph clustering tasks, we achieve competitive performance against state-of-the-art neural graph clustering baselines in synthetic and real-world datasets.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: how to use Graph Neural Networks (GNNs) to optimize the "map equation" in community detection, so as to achieve more efficient and accurate community discovery in unsupervised graph clustering tasks. Specifically, the paper aims to: 1. **Combine information - theoretic methods with deep learning**: By transforming traditional information - theoretic - based community detection methods (such as the map equation) into differentiable loss functions and combining GNNs for end - to - end learning, in order to achieve automated community detection. 2. **Improve the effect of community detection**: By introducing node features and soft clustering assignment matrices, the model can better handle complex network structures in the real world, and at the same time automatically select the optimal number of communities without the need for explicit regularization. 3. **Avoid over - fitting problems**: By the Minimum Description Length (MDL) principle, ensure that the model does not over - partition the network, thereby avoiding over - fitting problems. This is different from traditional methods, which usually require explicit regularization or cross - validation to prevent over - fitting. 4. **Improve the flexibility and extensibility of the model**: By using gradient descent to optimize the map equation, this method can be compatible with any neural network architecture and can perform efficient parallel computing on GPU clusters. ### Specific contributions of the paper - **Propose Neuromap**: A map - equation - optimization algorithm based on deep learning, which can automatically select the optimal number of communities in unsupervised community detection and can handle overlapping communities. - **Experimental verification**: Extensive experiments were carried out on hundreds of synthetic datasets and ten real - world datasets, and the results show that Neuromap outperforms existing GNN baseline methods in most cases. - **Reveal the limitations of existing methods**: By setting a higher maximum number of communities, the authors found that existing methods tend to over - fit and report results far exceeding the actual number of communities, especially when there are no appropriate limitations. In summary, the main goal of this paper is to propose a new, more flexible and efficient community - detection method by combining the advantages of information theory and deep learning to meet the challenges in complex network analysis.

The Map Equation Goes Neural: Mapping Network Flows with Graph Neural Networks

Discovering the Representation Bottleneck of Graph Neural Networks from Multi-order Interactions

Map Equation Centrality: Community-aware Centrality based on the Map Equation

Community Detection with the Map Equation and Infomap: Theory and Applications

iMap: Incremental Node Mapping between Large Graphs Using GNN

Graph Neural Modeling of Network Flows

Mapping biased higher-order walks reveals overlapping communities

Learning Deep Graph Representations via Convolutional Neural Networks

Re-Think and Re-Design Graph Neural Networks in Spaces of Continuous Graph Diffusion Functionals

Graph neural network inspired algorithm for unsupervised network community detection

Scalable and Efficient Flow-Based Community Detection for Large-Scale Graph Analysis

How Graph Neural Networks Learn: Lessons from Training Dynamics

Unboxing the graph: Neural Relational Inference for Mobility Prediction

EdgeMap: An Optimized Mapping Toolchain for Spiking Neural Network in Edge Computing

Unsupervised Network Embedding for Graph Visualization, Clustering and Classification

A Unified View on Neural Message Passing with Opinion Dynamics for Social Networks

Learning Convolutional Neural Networks for Data-Flow Graph Mapping on Spatial Programmable Architectures (abstract Only)

Erdos Goes Neural: an Unsupervised Learning Framework for Combinatorial Optimization on Graphs

Neural Graph Machines: Learning Neural Networks Using Graphs

A network analysis-based framework to understand the representation dynamics of graph neural networks

Neural Monge Map estimation and its applications