A Thorough Comparison Between Independent Cascade and Susceptible-Infected-Recovered Models

Panfeng Liu,Guoliang Qiu,Biaoshuai Tao,Kuan Yang
2024-08-21
Abstract:We study cascades in social networks with the independent cascade (IC) model and the Susceptible-Infected-recovered (SIR) model. The well-studied IC model fails to capture the feature of node recovery, and the SIR model is a variant of the IC model with the node recovery feature. In the SIR model, by computing the probability that a node successfully infects another before its recovery and viewing this probability as the corresponding IC parameter, the SIR model becomes an "out-going-edge-correlated" version of the IC model: the events of the infections along different out-going edges of a node become dependent in the SIR model, whereas these events are independent in the IC model. In this paper, we thoroughly compare the two models and examine the effect of this extra dependency in the SIR model. By a carefully designed coupling argument, we show that the seeds in the IC model have a stronger influence spread than their counterparts in the SIR model, and sometimes it can be significantly stronger. Specifically, we prove that, given the same network, the same seed sets, and the parameters of the two models being set based on the above-mentioned equivalence, the expected number of infected nodes at the end of the cascade for the IC model is weakly larger than that for the SIR model, and there are instances where this dominance is significant. We also study the influence maximization problem with the SIR model. We show that the above-mentioned difference in the two models yields different seed-selection strategies, which motivates the design of influence maximization algorithms specifically for the SIR model. We design efficient approximation algorithms with theoretical guarantees by adapting the reverse-reachable-set-based algorithms, commonly used for the IC model, to the SIR model.
Social and Information Networks,Physics and Society
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the differences between the Independent Cascade (IC) model and the Susceptible - Infected - Recovered (SIR) model in the social network spreading process and their impacts on the influence maximization problem. Specifically: 1. **Model Comparison**: - The IC model assumes that once a node is activated (infected), it will always remain activated, and the infection events on different edges are independent of each other. - The SIR model introduces a "recovery" mechanism, that is, a node may recover after being infected, and can no longer be infected or spread the infection after recovery. In addition, in the SIR model, since a node may successfully infect its neighbors in multiple attempts, the infection events on different edges become correlated. 2. **Dependence Impact**: - The paper proves through the coupling argument method that, given the same network structure, seed set, and parameter settings, the expected number of infected nodes in the IC model is always greater than or equal to that in the SIR model. This means that the seeds in the IC model have a stronger spreading effect. - The author also shows that in some instances, this gap can be very significant, indicating that the additional dependence in the SIR model will weaken the effect of information spreading. 3. **Influence Maximization Problem**: - The influence maximization problem is to select a set of initial nodes (seeds) to maximize their influence in the social network. The paper points out that although the IC and SIR models show similarities in some cases, their optimal seeding strategies may be different. - To meet this challenge, the author designs efficient approximation algorithms for the SIR model, which are based on the reverse reachable set technique and provide theoretical guarantees. ### Formula Summary - **Infection Probability in the SIR Model**: \[ p_{u,v}=\sum_{t = 1}^{\infty}\gamma_u(1-\gamma_u)^{t - 1}(1-(1-\beta_{u,v})^t) \] where $\gamma_u$ is the recovery rate of node $u$, and $\beta_{u,v}$ is the probability that node $u$ successfully infects node $v$. ### Main Contributions - **Comparison of Spreading Effects**: Through strict mathematical proofs, it is shown that the IC model is generally superior to the SIR model in terms of spreading effects. - **Algorithm Design**: Proposes efficient approximation algorithms for the influence maximization problem of the SIR model, ensuring that a seed set close to the optimal solution can be quickly found even in large - scale networks. ### Conclusion This research not only reveals the essential differences between the IC and SIR models in information spreading, but also provides new solutions for the influence maximization problem in practical applications. This is of great significance for understanding the social network spreading mechanism and optimizing marketing strategies, etc.