Abstract:It is widely acknowledged that the initial spreaders play an important role for the wide spreading of information in complex networks. Thus, a variety of centrality-based methods have been proposed to identify the most influential spreaders. However, most of the existing studies have overlooked the fact that in real social networks it is more costly and difficult to convince influential individuals to act as initial spreaders, resulting in a high risk in maximizing the spreading. In this paper, we address this problem on the basis of the assumption that large-degree nodes are activated with a higher risk than small-degree nodes. We aim to identify the effective initial spreaders to maximize spreading when considering both the activation risk and the outbreak size of initial spreaders. On random networks, the analytical analysis reveals that the degree of optimal initial spreaders does not correspond to the largest degree of nodes in the network but rather be determined by infection probability and difference of activation risk among nodes with different degree. Here, we propose a risk-aware metric to identify the effective spreaders on real networks. The numerical simulation shows that the risk-aware metric outperforms the existing benchmark centralities in maximizing the effective spreading.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to maximize the spread effect in complex networks while considering the risk of node activation. Specifically, the traditional influence maximization problem usually assumes that the activation probability of initial spreaders is independent of their individual influence. However, in real - world applications, highly influential individuals have a higher activation cost and are more difficult to activate as initial spreaders, so there is a higher risk. For example, in marketing, hiring highly influential individuals to promote products requires higher payments, and these individuals may be unwilling to participate due to time or liability constraints.
The paper proposes that when considering both the spread scale and the node activation risk, simply choosing initial spreaders based on existing centrality indicators may not maximize the spread effect. Therefore, the authors propose a risk - aware metric (RA) aimed at identifying effective initial spreaders to maximize the spread effect while considering the activation risk. This metric achieves this goal by rewarding nodes connected to highly influential neighbors and penalizing highly influential nodes.
### Main Contributions
1. **Introducing Activation Risk**: For the first time, the paper introduces the concept of node activation risk in the spread maximization problem, believing that highly influential nodes have a higher activation risk.
2. **Proposing a Risk - Aware Metric**: A new risk - aware metric (RA) is designed, which comprehensively considers the spread ability and activation risk of nodes.
3. **Theoretical Analysis and Experimental Verification**: The effectiveness of the new metric is verified through theoretical analysis on random networks and numerical simulations on real - world networks.
### Formula Explanation
- **Activation Probability Formula**:
\[
p_i=\frac{e^{-\lambda k_i}}{\langle k\rangle}
\]
where \(p_i\) is the probability that node \(i\) is activated, \(k_i\) is the degree of node \(i\), \(\langle k\rangle\) is the average degree of the network, and \(\lambda\) is a risk parameter representing the difference in activation probabilities between nodes with different degrees.
- **Effective Spread Coverage**:
\[
\tilde{s}_i = p_i\cdot s_i
\]
where \(s_i\) is the coverage range starting from node \(i\), that is, the proportion of infected nodes to the total number of nodes, and \(\tilde{s}_i\) is the effective spread coverage after considering the activation risk.
- **Degree of the Optimal Initial Spreader**:
\[
k^*=\frac{\langle k\rangle}{\lambda}-1-\frac{\beta}{\beta_c}\cdot\frac{\langle k^2\rangle-\langle k\rangle}{\langle k\rangle}
\]
where \(k^*\) is the degree of the optimal initial spreader, \(\beta\) is the infection rate, \(\beta_c\) is the critical infection rate, and \(\langle k^2\rangle\) is the second - order moment of the degree.
### Experimental Methods
- **SIR Model**: Use the SIR model to simulate the spread process of information in social networks.
- **Benchmark Method**: Use the ratio of existing centrality indicators (such as degree centrality, core number, closeness centrality, betweenness centrality, eigenvector centrality, etc.) to the degree as the benchmark method for comparison.
### Conclusion
Through theoretical analysis and experiments, the paper proves that the risk - aware metric (RA) can more effectively identify initial spreaders when considering the node activation risk, thereby maximizing the spread effect. This research provides new ideas and tools for spread optimization in practical applications.