| Literature DB >> 32838229 |
Julie Jiang1,2, Emily Chen1,2, Shen Yan1,2, Kristina Lerman1,2, Emilio Ferrara1,2,3.
Abstract
Since the outbreak in China in late 2019, the novel coronavirus (COVID-19) has spread around the world and has come to dominate online conversations. By linking 2.3 million Twitter users to locations within the United States, we study in aggregate how political characteristics of the locations affect the evolution of online discussions about COVID-19. We show that COVID-19 chatter in the United States is largely shaped by political polarization. Partisanship correlates with sentiment toward government measures and the tendency to share health and prevention messaging. Cross-ideological interactions are modulated by user segregation and polarized network structure. We also observe a correlation between user engagement with topics related to public health and the varying impact of the disease outbreak in different U.S. states. These findings may help inform policies both online and offline. Decision-makers may calibrate their use of online platforms to measure the effectiveness of public health campaigns, and to monitor the reception of national and state-level policies, by tracking in real-time discussions in a highly polarized social media ecosystem.Entities:
Keywords: COVID‐19; communication dynamics; content analysis; coronavirus; geospatial analysis; network analysis; partisanship; polarization; social media platforms; user behavior modeling
Year: 2020 PMID: 32838229 PMCID: PMC7323338 DOI: 10.1002/hbe2.202
Source DB: PubMed Journal: Hum Behav Emerg Technol ISSN: 2578-1863
Examples of tracked keywords (case insensitive) we used to collect our data and when we begin tracking them
| Keywords | Since | Keywords | Since |
|---|---|---|---|
| Corona | 1/21/20 | Corona virus | 1/21/20 |
| Wuhan | 1/21/20 | COVID‐19 | 2/16/20 |
| Epidemic | 1/2/20 | SARS‐CoV‐2 | 3/6/20 |
| Outbreak | 1/21/20 | Pandemic | 3/12/20 |
| Ncov | 1/21/20 | COVD | 3/12/20 |
Statistics of the full dataset and the U.S. dataset
| Time period | 1/21–4/3 |
|---|---|
|
| |
| No. of total tweets | 87 million |
| No. of (original) tweets | 10 million |
| No. of reply tweets | 4.8 million |
| No. of retweets | 17 million |
| No. of users | 18 million |
| No. of tweets with | 635,000 |
| No. of tweets with | 56 million |
|
| |
| No. of tweets in the United States | 14.5 million |
| No. of users in the United States | 2.3 million |
| No. of tweets in a U.S. state | 12.4 million |
| No. of users in a U.S. state | 1.8 million |
Dataset statistics of the top 10 most active states
| State | Partisanship |
| % |
|---|---|---|---|
| CA | Democratic | 2,007,490 | 16 |
| TX | Republican | 1,268,246 | 10 |
| NY | Democratic | 1,184,178 | 10 |
| FL | Republican | 880,832 | 7 |
| IL | Democratic | 447,082 | 4 |
| PA | Split | 386,996 | 3 |
| OH | Republican | 344,356 | 3 |
| GA | Republican | 343,094 | 3 |
| DC | Democratic | 328,965 | 3 |
| NC | Split | 325,048 | 3 |
Note: is the number of tweets geo‐tagged with the state, and % is the fraction of all tweets geo‐tagged United States.
FIGURE 1The time series of tweet and retweet volume in the United States averaged over a sliding window of 3 days (left y‐axis), and the log‐scaled daily confirmed and death cases in the United States (right y‐axis). Key events are bubbled and described on the bottom
Hashtags with the highest geo‐trending Importance Ratios (IR) in Democratic‐leaning, split control, Republican‐leaning and all states in the United States
| Democratic‐leaning states | Hashtag | Republican‐leaning states | United States (all) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Hashtag |
| IR | Hashtag |
| IR | Hashtag |
| IR | Hashtag |
| IR |
| Seattle | 352.2 | 4.90 | smartnews | 285.2 | 3.98 | florida | 510.3 | 3.97 | pencedemic | 8,957 | 3.44 |
| pencedemic | 421.7 | 3.21 | familiesfirst | 126.4 | 3.01 | fisa | 255.8 | 3.72 | cpac2020 | 6,575 | 3.27 |
| medicareforall | 324.8 | 3.19 | trumpcrash | 157.4 | 2.98 | trump2020 | 727.7 | 3.57 | smartnews | 9,792 | 3.05 |
| familiesfirst | 232.4 | 3.00 | trumpliesaboutcoronavirus | 221.2 | 2.96 | democrats | 643.7 | 3.35 | trump2020 | 15,971 | 2.93 |
| trumpvirus | 1,154.3 | 2.83 | trumpviruscoverup | 137.8 | 2.94 | kag2020 | 482.3 | 3.29 | kag | 17,189 | 2.88 |
| trumpviruscoverup | 244.0 | 2.81 | medtwitter | 118.6 | 2.92 | Oann | 490.0 | 3.24 | usmca | 5,524 | 2.81 |
| demdebate | 415.9 | 2.79 | trumppandemic | 134.6 | 2.86 | stateoftheunion | 229.3 | 3.23 | democrats | 15,122 | 2.81 |
| onevoicel | 337.2 | 2.76 | onevoicel | 219.6 | 2.85 | usmca | 237.3 | 3.20 | trumpvirus | 27,492 | 2.79 |
| covid19us | 502.2 | 2.64 | demdebate | 241.4 | 2.82 | kag | 744.2 | 3.20 | trumpviruscoverup | 6,220 | 2.79 |
| trumppandemic | 240.2 | 2.63 | publichealth | 246.2 | 2.76 | michaelbloomberg | 777.3 | 3.14 | kag2020 | 10,732 | 2.78 |
Note: We color code hashtags that belong to the same category. Red: Trump's 2020 re‐election campaign slogans; Blue: politically‐charged left‐leaning hashtags; Yellow: other politically relevant hashtags. is the average hashtag usage in the given states.
FIGURE 2Daily trajectories of the intra‐ and inter‐state communication ratios with a bootstrapped 95% confidence interval. Blue: the ratio of intra‐state retweets out of retweets consumed by each state. Orange: the ratio of retweets produced from DC out of retweets consumed by each state. Vertical line: day of the first reported death in the United States
The top inter‐ and intra‐state hashtags ranked by and , respectively
| Top inter‐state |
| Top intra‐state |
|
|---|---|---|---|
| #trumpvirus | .91 | #flattenthecurve | .31 |
| #china | .91 | #socialdistancing | .28 |
| #pandemic | .91 | #coronavirus | .18 |
| #hoax | .91 | #washyourhands | .18 |
| #maga | .90 | #publichealth | .17 |
| #americafirst | .89 | #podcast | .17 |
| #trump | .87 | #stopthespread | .16 |
| #dobbs | .86 | #health | .15 |
| #demdebate | .85 | #quarantinelife | .15 |
| #coronavirususa | .85 | #stayhomesavelives | .15 |
FIGURE 3Popularity gain trajectories of the temporal hashtag clusters detected using the dipm‐SC method (Ozer et al., 2020)
The temporal hashtag clusters detected by the dipm‐SC method (Ozer et al., 2020), and the semantically similar sub‐clusters detected by the Louvain method (Blondel et al., 2008)
| Clusters | Hashtags | |
|---|---|---|
| A | 1 |
|
| 2 | #ai #economy #epidemic #freespeech #freezerohedge #health #hiv #iot #oil #oott #outbreak #ukraine | |
| 3 | #censorship #coronaviruscanada #coronavirusjapan #coronavirustruth #flu | |
| B | 1 | #cpac2020 #demdebate #fisa #pencedemic #supertuesday #trumpcrash |
| 2 |
| |
| C | 1 |
#aids #coronavirus #donaldtrump #ebola #foxnews #icymi #onevoice1 #pandemic #podcast #publichealth #washyourhands |
| 2 | #cdc #climatechange #climatecrisis #media #medicareforall #trump #trump2020 | |
| 3 | #business #california #cnn #florida #healthcare #newyork #nyc #science #stocks #tech #vaccine | |
| 4 |
| |
| D | 1 | #coronapocalypse #covidiot #dontbeaspreader #familiesfirst #ppe #trumppandemic |
| 2 |
| |
Note: Only sub‐clusters with at least 4 hashtags are shown. Sub‐clusters of subjective but ideologically unifying hashtags are in bold.
The retweet likelihood ratio of hashtag sub‐clusters (rows) and (columns) (see Equation (3))
|
| A1 | C4 | B2 | D2 |
|---|---|---|---|---|
| Conspiracy (A1) | — | 1.40 | 0.86 | 0.44 |
| Right‐leaning (C4) | 1.16 | — | 1.16 | 0.58 |
| Left‐leaning/neutral (B2) | 0.71 | 1.16 | — | 0.83 |
| Health/prevention (D2) | 1.20 | 1.94 | 2.75 | — |
FIGURE 4The retweet network of users of hashtag sub‐clusters A1, B2, C4, and D2 laid out using ForceAtlas2 (Jacomy, Venturini, Heymann, & Bastian, 2014). The top 3 detected communities are colored accordingly. The red community is composed of mostly right‐leaning users (A1 and C4), and the blue and yellow communities are mostly left‐leaning/neutral users (B2). Users of health/prevention hashtags (D2) are twice as represented in the blue than the red community