| Literature DB >> 33024152 |
Matteo Cinelli1,2, Walter Quattrociocchi3,4,5, Alessandro Galeazzi6, Carlo Michele Valensise7, Emanuele Brugnoli1, Ana Lucia Schmidt2, Paola Zola8, Fabiana Zollo1,2,9, Antonio Scala1,10.
Abstract
We address the diffusion of information about the COVID-19 with a massive data analysis on Twitter, Instagram, YouTube, Reddit and Gab. We analyze engagement and interest in the COVID-19 topic and provide a differential assessment on the evolution of the discourse on a global scale for each platform and their users. We fit information spreading with epidemic models characterizing the basic reproduction number [Formula: see text] for each social media platform. Moreover, we identify information spreading from questionable sources, finding different volumes of misinformation in each platform. However, information from both reliable and questionable sources do not present different spreading patterns. Finally, we provide platform-dependent numerical estimates of rumors' amplification.Entities:
Mesh:
Year: 2020 PMID: 33024152 PMCID: PMC7538912 DOI: 10.1038/s41598-020-73510-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Upper panel: activity (likes, comments, reposts, etc.) distribution for each social media. Middle panel: most discussed topics about COVID-19 on each social media. Lower panel: cumulative number of content (posts, tweets, videos, etc.) produced from the 1st of January to the 14th of February. Due to the Twitter API limitations in gathering past data, the first data point for Twitter is dated January 27th.
Figure 2Growth of the number of authors versus time. Time is expressed in number of days since 1st January 2020 (day 1). Shaded areas represents [5%, 95%] estimates of the models obtained via bootstrapping least square estimates of the EXP model (upper panels) and of the SIR model (lower panels). For details the SIR and the EXP model, see SI.
[5%, 95%] interval of confidence as estimated from bootstrapping the least square fits parameter of the EXP and of the SIR model.
| Gab | YouTube | ||||
|---|---|---|---|---|---|
| [1.42, 1.52] | [1.44, 1.51] | [1.56, 1.70] | [2.02, 2.64] | [1.65, 2.06] | |
| [2.2, 2.5] | [2.4, 2.8] | [3.2, 3.5] | [4.0, 5.1] |
Notice that, due to the steepness of the growth of the number of new authors in Instagram, assumes unrealistic values for the SIR model.
Figure 3Upper panels: plot of the cumulative number of posts referring to questionable sources versus the cumulative number of posts referring to reliable sources. Lower panel: plot of the cumulative number of engagements relatives to questionable sources versus the cumulative number of engagements relative to reliable sources. Notice that a linear behavior indicates that the time evolution of questionable posts/engagements is just a re-scaled version of the time evolution of reliable posts/engagements. Each plot indicates the regression coefficients , representing the ratio among the volumes of questionable and reliable posts () and engagements (). In more popular social media, the number of questionable posts represents a small fraction of the reliable ones; same thing happens in Reddit. Among less popular social media, a peculiar effect is observed in Gab: while the volume of questionable posts is just the of the volume of reliable ones, the volume of engagements for questionable posts is times bigger than the volume for reliable ones. Further details concerning the regression coefficients are reported in Methods.
The average engagement of a post is the number of reactions expected for a post and is a measure of how much a post is amplified in each social media platform.
| Gab | 5.6 | 1.4 | 3.9 |
| 22.7 | 40.1 | 0.55 | |
| 15.1 | 15.6 | 0.97 | |
| YouTube | 0.35 |
The average engagement (for unreliable post) and (for reliable post) vary from platform to platform, and are the largest in Twitter and the lowest in Gab. The coefficient of relative amplification measures whether a social media amplifies more unreliable () or reliable () posts. Among more popular social media platforms, we notice that Twitter is the most neutral ( i.e. ), while YouTube amplifies unreliable sources less (). Among less popular social media platforms, Reddit reduces the impact of unreliable sources () while Gab strongly amplifies them ().
Data breakdown of the number of posts, comments and users for all platforms.
| Posts | Comments | Users | Period | |
|---|---|---|---|---|
| Gab | 6,252 | 4,364 | 2,629 | 01/01–14/02 |
| 10,084 | 300,751 | 89,456 | 01/01–14/02 | |
| YouTube | 111,709 | 7,051,595 | 3,199,525 | 01/01–14/02 |
| 26,576 | 109,011 | 52,339 | 01/01–14/02 | |
| 1,187,482 | – | 390,866 | 27/01–14/02 | |
| Total | 1,342,103 | 7,465,721 | 3,734,815 |
Number of posts containing a URL, matching ability and classification for each of the five platforms.
| Gab | YouTube | ||||
|---|---|---|---|---|---|
| Posts containing a URL | 3,778 | 10,084 | 351,786 | 1,328 | 356,448 |
| Matched | 0.47 | 0.55 | 0.035 | 0.09 | 0.27 |
| Questionable | 0.38 | 0.045 | 0.064 | 0.05 | 0.10 |
| Reliable | 0.62 | 0.955 | 0.936 | 0.95 | 0.90 |
Fraction of URLs pointing to social media.
| Gab | YouTube | |||||
|---|---|---|---|---|---|---|
| Gab | 0.003 | 0.002 | 0.001 | 0.002 | 0.138 | ∼ 0 |
| 0.043 | 0.006 | 0.009 | 0.001 | ∼ 0 | 0 | |
| YouTube | 0 | ∼ 0 | 0.292 | ∼ 0 | 0.088 | 0.081 |
| 0 | 0 | 0.003 | 0 | 0.001 | 0.001 | |
| 0.059 | 0.001 | 0.257 | 0.003 | ∼ 0 | ∼ 0 |
Table should be read as entries in each row link to entries in each column. For example, Gab links to Reddit 0.003.
Results of text cleaning and analysis for all the corpora.
| Cleaned contents | Vocabulary size | Topics | Contents with | |
|---|---|---|---|---|
| 21,189 posts | 15,324 | 17 | 4,467 | |
| 638,214 posts | 22,587 | 21 | 369,131 | |
| Gab | 5,853 posts | 3,024 | 19 | 2,986 |
| 10,084 posts | 1,968 | 34 | 6,686 | |
| YouTube | 815,563 comments | 35,381 | 30 | 679,261 |
Coefficients and of the linear regressions displayed in Fig. 3.
| Dataset | Type | Intercept | Coefficient ( | |
|---|---|---|---|---|
| Gab | Posts | − 22.321 | 0.695 | 0.996 |
| Posts | − 4.111 | 0.047 | 0.997 | |
| Youtube | Posts | 4.529 | 0.073 | 0.998 |
| Posts | − 151.44 | 0.110 | 0.998 | |
| Gab | Reactions | 74.577 | 2.721 | 0.981 |
| Reactions | − 70.677 | 0.026 | 0.990 | |
| Youtube | Reactions | − 8854.33 | 0.025 | 0.986 |
| Reactions | − 2136.978 | 0.107 | 0.987 |