Abstract:Signed graphs are complex systems that represent trust relationships or preferences in various domains. Learning node representations in such graphs is crucial for many mining tasks. Although real-world signed relationships can be influenced by multiple latent factors, most existing methods often oversimplify the modeling of signed relationships by relying on social theories and treating them as simplistic factors. This limits their expressiveness and their ability to capture the diverse factors that shape these relationships. In this paper, we propose DINES, a novel method for learning disentangled node representations in signed directed graphs without social assumptions. We adopt a disentangled framework that separates each embedding into distinct factors, allowing for capturing multiple latent factors. We also explore lightweight graph convolutions that focus solely on sign and direction, without depending on social theories. Additionally, we propose a decoder that effectively classifies an edge's sign by considering correlations between the factors. To further enhance disentanglement, we jointly train a self-supervised factor discriminator with our encoder and decoder. Throughout extensive experiments on real-world signed directed graphs, we show that DINES effectively learns disentangled node representations, and significantly outperforms its competitors in the sign prediction task.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to capture multiple latent factors when learning node representations in directed signed graphs without relying on social assumptions (such as balance theory or status theory). Most of the existing methods tend to over - simplify the modeling of signed relationships, relying only on some simple social theories, which limits their expressiveness and ability to capture complex relationships.
### Problem Background
In many practical scenarios, signed graphs are used to represent complex systems such as trust relationships and preferences. For example, in social networks, edges can represent trust or distrust relationships between users. Learning node representations in these graphs is crucial for various mining tasks, such as link prediction, node classification, and community discovery. However, signed relationships in the real world are usually influenced by multiple latent factors, while existing methods often consider only a few simple factors, such as amity/hostility or status, which leads to insufficient model expressiveness.
### Paper Objectives
To solve the above problems, this paper proposes DINES (Disentangled Neural Networks for Signed Directed Graphs), a new graph neural network method, aiming to learn node representations in directed signed graphs without relying on any social assumptions. Specifically, DINES achieves this goal in the following ways:
1. **Disentangled Representation Framework**: Decompose the embedding of each node into multiple independent latent factors, thereby capturing multiple latent factors.
2. **Light - weight Graph Convolution**: Design a light - weight graph convolution layer by only considering the direction and sign of edges, without relying on social theories.
3. **Decoder Design**: Propose a new decoding strategy to construct edge features by considering the pairwise correlations of disentangled factors between nodes and predict the sign of edges.
4. **Self - supervised Factor Discriminator**: To further enhance the disentangling effect, a self - supervised factor discriminator is jointly trained.
### Advantages of the Solution
- **Accuracy**: DINES performs excellently in the link - sign prediction task. Compared with the most accurate competing method, AUC and Macro - F1 are increased by 3.1% and 6.5% respectively.
- **Scalability**: Its training and inference times have a linear relationship with the number of edges in real and synthetic signed graphs, showing good scalability.
- **Trade - off between Speed and Accuracy**: It shows a better trade - off between training time and accuracy, outperforming existing signed graph neural network methods.
### Formula Presentation
To understand the working principle of DINES more clearly, the following are some of the key formulas:
1. **Initial Disentangling Operation**:
\[
f_v^{(0), k}=\text{normalize}\left(\sigma\left(\text{FC}_k^{(0)}(x_v)\right)\right)=\frac{\tanh\left(W_k^{\top}x_v + b_k\right)}{\|\tanh\left(W_k^{\top}x_v + b_k\right)\|_2}
\]
where \(f_v^{(0), k}\) is the \(k\)-th latent factor of node \(v\), \(\sigma\) is a non - linear activation function (such as \(\tanh\)), and \(W_k\) and \(b_k\) are the weight and bias parameters respectively.
2. **Disentangled Signed Graph Convolution**:
\[
f_u^{(l), k}=\text{dsg - conv}_k^{(l)}\left(f_v^{(l - 1), k}:v\in N_u\cup\{u\}\right)
\]
\[
m_u^{(l), k}=\sum_{\delta\in D}\text{aggregate}_k\left(f_v^{(l - 1), k}:v\in N_\delta^u\right)
\]
\[
f_u^{(l), k}=\sigma\left(\text