| Literature DB >> 33286498 |
Liang Zhang1, Yong Quan1, Bin Zhou1, Yan Jia1, Liqun Gao1.
Abstract
The recent development of the mobile Internet and the rise of social media have significantly enriched the way people access information. Accurate modeling of the probability of information propagation between users is essential for studying information dissemination issues in social networks. As the dissemination of information is inseparable from the interactions between users, the probability of propagation can be characterized by such interactions. In general, there are differences in the dissemination modes of information that carry different topics in a real social network. Using these factors, we propose a method (TMIVM) to measure the mutual influence between users at the topic level. The method associates two vectorization parameters for each user-an influence vector and a susceptibility vector-where the dimensions of the vector represent different topic categories. The magnitude of the mutual influence between users on different topics can be obtained by the product of the corresponding elements of the vectors. Specifically, in this article, we fit a social network historical information cascade data through Survival Analysis to learn the parameters of the influence and susceptibility vectors. The experimental results on a synthetic data set and a real Microblog data set show that this method better measures the propagation probability and information cascade predictions compared to other methods.Entities:
Keywords: Survival Analysis; TMIVM (Topic Mutual Influence Vector Model); information dissemination; mutual influence; propagation probability
Year: 2020 PMID: 33286498 PMCID: PMC7517265 DOI: 10.3390/e22070725
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Symbol definitions and explanations.
| Symbol | Explanation |
|---|---|
|
| Social Networks |
|
| A set of users |
|
| A set of following relationships between users |
|
| Number of topics |
|
| An original blog message |
|
| Topic distribution vector of information |
|
| Cascade recording of information |
|
| The parent node of |
|
| Influence vector |
|
| Susceptibility vector |
|
| Mutual influence of user |
|
| Probability of information m being propagated to |
Figure 1Schematic diagram of the information cascade propagation.
Figure 2Distribution statistics of the time interval between the original blog posts being forwarded. (a) Distribution of forwarding time intervals within one week; (b) Distribution of forwarding time intervals within 2 h.
Figure 3Evaluation of user influence and susceptibility measured in different data sets. (a) Estimation of influence in the Forest Fire Model; (b) Estimation of susceptibility in the Forest Fire Model; (c) Estimation of influence in the Kronecker Graph; (d) Estimation of susceptibility in the Kronecker Graph.
Figure 4Algorithm prediction ROC curve of user forwarding information. (a) The ROC curve of the algorithm in Forest Fire Model; (b) The Roc curve of the algorithm in Kronecker graph.
Predicted AUC values in the forest fire model.
| Cascade Scale | Comparison Method | |||
|---|---|---|---|---|
| ICEM | NETRATE | LIS | TMIVM | |
| 1000 | 0.539 ± 0.028 | 0.561 ± 0.024 | 0.612 ± 0.021 | 0.685 ± 0.015 |
| 2000 | 0.558 ± 0.033 | 0.597 ± 0.037 | 0.651 ± 0.017 | 0.734 ± 0.014 |
| 5000 | 0.603 ± 0.031 | 0.647 ± 0.018 | 0.715 ± 0.015 | 0.788 ± 0.008 |
Predicted AUC values in the Kronecker graph model.
| Cascade Scale | Comparison Method | |||
|---|---|---|---|---|
| ICEM | NETRATE | LIS | TMIVM | |
| 1000 | 0.525 ± 0.045 | 0.554 ± 0.031 | 0.568 ± 0.027 | 0.653 ± 0.019 |
| 2000 | 0.547 ± 0.041 | 0.589 ± 0.026 | 0.617 ± 0.028 | 0.699 ± 0.012 |
| 5000 | 0.596 ± 0.039 | 0.638 ± 0.021 | 0.679 ± 0.022 | 0.741 ± 0.010 |
Figure 5Cascading scale prediction of different types of topic information in the forest fire model.
Figure 6Operational efficiency of the algorithm in the Kronecker graph model.
Description of the microblog data set.
| Attribute Description | Statistics |
|---|---|
| User | 10,794 |
| Number of following relationships | 341,652 |
| Original microblogs | 57,347 |
| Retweets | 2,892,451 |
Figure 7Information forwarding cascade prediction: ROC curve.
Figure 8Prediction of an intermediate scale real data set.
Figure 9Efficiency of the algorithm in the real data set.