| Literature DB >> 34977334 |
Veronika Batzdorfer1, Holger Steinmetz2, Marco Biella3, Meysam Alizadeh4.
Abstract
The COVID-19 pandemic resulted in an upsurge in the spread of diverse conspiracy theories (CTs) with real-life impact. However, the dynamics of user engagement remain under-researched. In the present study, we leverage Twitter data across 11 months in 2020 from the timelines of 109 CT posters and a comparison group (non-CT group) of equal size. Within this approach, we used word embeddings to distinguish non-CT content from CT-related content as well as analysed which element of CT content emerged in the pandemic. Subsequently, we applied time series analyses on the aggregate and individual level to investigate whether there is a difference between CT posters and non-CT posters in non-CT tweets as well as the temporal dynamics of CT tweets. In this regard, we provide a description of the aggregate and individual series, conducted a STL decomposition in trends, seasons, and errors, as well as an autocorrelation analysis, and applied generalised additive mixed models to analyse nonlinear trends and their differences across users. The narrative motifs, characterised by word embeddings, address pandemic-specific motifs alongside broader motifs and can be related to several psychological needs (epistemic, existential, or social). Overall, the comparison of the CT group and non-CT group showed a substantially higher level of overall COVID-19-related tweets in the non-CT group and higher level of random fluctuations. Focussing on conspiracy tweets, we found a slight positive trend but, more importantly, an increase in users in 2020. Moreover, the aggregate series of CT content revealed two breaks in 2020 and a significant albeit weak positive trend since June. On the individual level, the series showed strong differences in temporal dynamics and a high degree of randomness and day-specific sensitivity. The results stress the importance of Twitter as a means of communication during the pandemic and illustrate that these beliefs travel very fast and are quickly endorsed. Supplementary Information: The online version contains supplementary material available at 10.1007/s41060-021-00298-6.Entities:
Keywords: COVID-19; Conspiracy beliefs; Time series analysis; Twitter structural break analysis; Word embedding
Year: 2021 PMID: 34977334 PMCID: PMC8703214 DOI: 10.1007/s41060-021-00298-6
Source DB: PubMed Journal: Int J Data Sci Anal
Levels of research questions
| CT group | Non-CT group | |
|---|---|---|
| Aggregate level (averaged tweets) | ||
| Individual level | ||
CT Conspiracy theory, RQ Research question
Fig. 1Framework for constructing Global Vectors for Word Representation (GloVe) models and measuring similarity
Fig. 2Illustration of the Concept Mover’s Distance principle (with and representing fictitious tweets and a “pseudo”-document comprising only one term) (see also Kusner et al. [63])
Descriptive statistics of tweets by account for the CT group and non-CT group
| Tweet characteristics per user | CT group | Non-CT group | ||||
|---|---|---|---|---|---|---|
| Min | Max | Min | Max | |||
| Number of daily tweets | 11.70 (11.60) | 0.27 | 70.00 | 2.56 (1.79) | 0.05 | 9.13 |
| Number of daily corona-related tweets over the year | 0.59 (0.69) | 0.01 | 3.62 | 0.46 (0.57) | 0.01 | 3.85 |
| Proportion of corona-related tweets over the year | .04 (.02) | 0.003 | 0.11 | .12 (.12) | 0.005 | 0.59 |
| Number of conspiracy-related tweets over the year | 5.33 (6.07) | 0.11 | 44.9 | – | – | – |
| Proportion of conspiracy-related tweets over the year | 0.35 (0.13) | 0.06 | 0.65 | – | – | – |
Fig. 3Mean proportion of COVID-19-related tweets by the CT group and the non-CT group
Results of the GAM investigating differences between the non-CT posters and CT posters in overall coronavirus-related tweets
| Separate smooth model | ||
|---|---|---|
| Intercept | − 1.61*** | < .001 |
| Group: CT group | − 1.53*** | < .001 |
EDF = Effective degrees of freedom (indicates the amount of wiggliness of a curve); EDF = 1 indicates a straight line; ***p < .001
Fig. 4Distribution of CT posters (upper panel) and mean proportion of CT tweets (bottom panel)
Fig. 5Structural break analysis of CT tweets for the CT group via the CUSUM and F-test. Grey areas indicate confidence intervals for two structural breaks on 10 March 2020 and 8 June 2020 (dashed lines)
Results of the generalised additive mixed-effects model addressing inter-individual differences in time trends
| Random intercept model | Random slope model | Random smooth model | ||||
|---|---|---|---|---|---|---|
| EDF | EDF | EDF | ||||
| Time trend | 7.77*** | < .001 | 7.77*** | < .001 | 7.50*** | < .001 |
| Weekday | 2.36 | .083 | 2.34 | .092 | 2.34 | .095 |
| Random intercept | 102.80*** | < .001 | 77.85*** | < .001 | ||
| Random slope | 71.42*** | < .001 | ||||
| Random smooth | 266.14*** | < .001 | ||||
| R square | .139 | .156 | .180 | |||
| Deviance explained | .009 | .009 | .010 | |||
| AIC | − 423,743.1 | − 424,046.6 | − 424,468.2 | |||
EDF = Effective degrees of freedom (indicates the amount of wiggliness of a curve); EDF = 1 indicates a straight line; ***p < .001; AIC = Akaike information criterion
Fig. 6Proportion of CT-related tweets and effective degrees of freedom (EDF) for a subsample of individual CT posters (at least 200 days of posting behaviour)