| Literature DB >> 32275716 |
Alon Bartal1, Nava Pliskin2, Oren Tsur3.
Abstract
Contagion in online social networks (OSN) occurs when users are exposed to information disseminated by other users. Studies of contagion are largely devoted to the spread of viral information and to local neighbor-to-neighbor contagion. However, many contagion events can be non-viral in the sense of being unpopular with low reach size, or global in the sense of being exposed to non-adjacent neighbors. This study aims to investigate the differences between local and global contagion and the different contagion patterns of viral vs. non-viral information. We analyzed three datasets and found significant differences between the temporal spreading patterns of local contagion compared to global contagion. Based on our analysis, we can successfully predict whether a user will be infected by either a local or a global contagion. We achieve an F1-score of 0.87 for non-viral information and an F1-score of 0.84 for viral information. We propose a novel method for early detection of the viral potential of an information nugget and investigate the spreading of viral and non-viral information. In addition, we analyze both viral and non-viral contagion of a topic. Differentiating between local versus global contagion, as well as between viral versus non-viral information, provides a novel perspective and better understanding of information diffusion in OSNs.Entities:
Year: 2020 PMID: 32275716 PMCID: PMC7147744 DOI: 10.1371/journal.pone.0230811
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
A summary of variables definitions.
| Parameter | Particular (original) message | Topic |
|---|---|---|
| Contagion | Users are infected by a single message | Users are infected by a topic ( |
| Social network | ||
| Activity network | ||
| Local contagion of User | ||
| Global contagion of User | ||
| Viral event | ∑ | ∑ |
| Non-viral event | 10 ≤ ∑ | 10 ≤ ∑ |
| Analyzed dataset | DS1, DS2 | DS3 |
| Distance on | Distance on | |
| Distance on | Distance on |
t—time of infection of node v; v—infecting node (who adopted the information); v0—the user who originated the information.
Fig 1An activity network with a social network.
Distances are measured on the social network.
A summary of the analyzed datasets.
| Dataset | Information type | # Contagion events | | | | | | | | | Global | Local |
|---|---|---|---|---|---|---|---|---|
| DS1 | Non-viral | 1,950 | 4.1k | 26.6k | 0.85M | 3.21M | 12.14% | 87.86% |
| DS1 | Viral | 127 | 42.2k | 132.1k | 10.3M | 53.3M | 21.75% | 78.25% |
| DS2 | Non-viral | 3,156 | 25.6k | 172.2k | 11.04M | 141.4M | 18.29% | 81.71% |
| DS3 | Non-viral | 3,359 | 67.1k | 90.2k | 0.256M | 7.7M | 12.8% | 87.2% |
| DS3 | Viral | 556 | 139.3k | 299.3k | 0.256M | 7.7M | 16.9% | 83.1% |
k—thousands; M—millions; Global—% global contagions; Local—% local contagions.
Fig 3Distributions of distances, measured on the social network (G) for each of the three analyzed datasets, DS1 to DS3 (Table 2).
Contagion spread: The larger the distance the less likely a user will retweet.
| Non-viral | Viral | |||||
|---|---|---|---|---|---|---|
| Distance | DS1 | DS2 | DS3 | DS1 | DS2 | DS3 |
| – | ||||||
| – | ||||||
| Figure | – | |||||
d and d—are defined in Table 1.
Fig 2An activity network with a social network illustrating global contagion.
Logistic regression results.
| Term | Non-viral | Viral |
|---|---|---|
| -3.45 | -3.30 | |
| Δ | -9.48 | 12.83 |
| -38.60 | -3.31 | |
| 6.38 | 5.09 | |
| 0.87 | 0.84 |
** p < 2e × 1016,
*p < 0.01
Fig 4ECDFs of inter-retweet times.
The time axis scale is transformed into a log scale.
Time when 80% of inter-retweet times of local and global contagion are reached, considering non-viral information.
| Dataset | Global contagion (80%) | Local contagion (80%) |
|---|---|---|
| 3.6 hours | 1.8 hours | |
| 0.44 hours | 0.91 hours |
Kolmogorov-Smirnov results.
| DS | # | Pairs of ECDFs tested by a KS-test | D-statistic | Figure |
|---|---|---|---|---|
| DS1 | 1 | Non-viral local, Viral local | 0.48 | |
| 2 | Non-viral global, Viral global | 0.55 | ||
| 3 | Non-viral global, Non-viral local | 0.16 | ||
| 4 | Viral global, Viral local | 0.07 | ||
| BIT DS1 | 1 | Non-viral local, BIT local | 0.58 | |
| 2 | Non-viral global, BIT global | 0.51 | ||
| 3 | Non-viral global, Non-viral local | 0.16 | ||
| 4 | BIT global, BIT local | 0.13 | ||
| DS3 | 1 | Non-viral local, Viral local | 0.37 | |
| 2 | Non-viral global, Viral global | 0.47 | ||
| 3 | Non-viral global, Non-viral local | 0.12 | ||
| 4 | Viral global, Viral local | 0.09 |
*P < 2.2 × 10-16;
**P < 2.2 × 10-22
Fig 5Retweet-count distribution of BIT and non-viral tweets.