| Literature DB >> 35259181 |
Haiyan Yu1, Ching-Chi Yang2, Ping Yu3, Ke Liu1.
Abstract
Coronavirus disease 2019 (COVID-19) has triggered an enormous number of discussion topics on social media Twitter. It has an impact on the global health system and citizen responses to the pandemic. Multiple responses (replies, favorites, and retweets) reflect the followers' attitudes and emotions towards these tweets. Twitter data such as these have inspired substantial research interest in sentiment and social trend analyses. To date, studies on Twitter data have focused on the associational relationships between variables in a population. There is a need for further discovery of causality, such as the influence of sentiment polarity of tweet response on further discussion topics. These topics often reflect the human perception of COVID-19. This study addresses this exact topic. It aims to develop a new method to unveil the causal relationships between the sentiment polarity and responses in social media data. We employed sentiment polarity, i.e., positive or negative sentiment, as the treatment variable in this quasi-experimental study. The data is the tweets posted by nine authoritative public organizations in four countries and the World Health Organization from December 1, 2019, to May 10, 2020. Employing the inverse probability weighting model, we identified the treatment effect of sentiment polarity on the multiple responses of tweets. The topics with negative sentiment polarity on COVID-19 attracted significantly more replies (69±49) and favorites (688±677) than the positive tweets. However, no significant difference in the number of retweets was found between the negative and positive tweets. This study contributes a new method for social media analysis. It generates new insight into the influence of sentiment polarity of tweets about COVID-19 on tweet responses.Entities:
Mesh:
Year: 2022 PMID: 35259181 PMCID: PMC8903302 DOI: 10.1371/journal.pone.0264794
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Statistics of the Twitter accounts of the nine studied public organizations.
| ID | Entity | Org. | Location | Joined | Tweets1 | Following | Followers | Tweets2 |
|---|---|---|---|---|---|---|---|---|
| 1 | CDCgov | USCDC | US | May-10 | 26.6K | 267 | 2.6M | 441 |
| 2 | ChinaCDC | CNCDC | China | Jan-20 | 112 | 35 | 216 | 85 |
| 3 | healthgovau | AUDoH | Australian | Mar-10 | 15K | 135 | 79.6K | 457 |
| 4 | NHSEngland | UKNHS | UK | Apr-12 | 42.8K | 2379 | 415K | 313 |
| 5 | WHO | WHO | Inter-national | May-09 | 12.8K | 1744 | 74.5K | 2039 |
| 6 | australian | Australian | Australia | Oct-07 | 242.1K | 545 | 717.9K | 1084 |
| 7 | bbcworld | BBC | UK | Feb-07 | 296K | 70 | 25.3M | 1411 |
| 8 | ChinaDaily | CDaily | China | Nov-09 | 97K | 490 | 3.95M | 6394 |
| 9 | cnn | CNN | US | Feb-07 | 248K | 1107 | 42.1M | 7694 |
Notes: US: United States; UK: United Kingdom; NA: Missing; K denotes thousand; M denotes million.
WHO: World Health Organization; Org.: Organization.
Joined: Date of the organizational account joining the Twitter platform; May-10 means May 2010.
Tweets 1: Number of total tweets.
Tweets 2: Number of Tweets during the study period.
ChinaCDC: China CDC Weekly, Platform of Chinese Center for Disease Control and Prevention.
Variable definitions and measurements.
| Variables | Definitions | Measurements |
|---|---|---|
| Dependent variable (multiple responses) | ||
| RP | Mean of replies | The average number of replies for an organization’s weekly tweet. |
| FV | Mean of favorites | The average number of favorites for an organization’s weekly tweet. |
| RT | Mean of retweets | The average number of retweets for an organization’s weekly tweet. |
| Objective | ||
|
| Differential effect of treatment | DET = mean of |
|
| ||
|
| Sentiment polarity | A dummy variable indicating the status of the sentiment score. 1 indicates the positive score, otherwise 0. |
| Covariates (X) | ||
|
| Number of tweets | The number of an organization’s weekly tweets. |
|
| Length of Text | The total length of an organization’s weekly tweets. |
|
| Median frequency | Median frequency of terms in an organization’s weekly tweets. |
|
| Number of items | The total number of terms in an organization’s weekly tweet. |
|
| Highest frequency | Highest frequency of terms in an organization’s weekly tweets. |
|
| Confirmed cases | The number of weekly confirmed COVID-19 cases in an organization’s registered country, and total number of worldwide cases for WHO. |
|
| Number of deaths | The number of weekly deaths due to COVID-19 in an organization’s registered country, and total number of worldwide deaths for WHO. |
|
| Indicator of organizations | OG = 0 indicating the entity is a government organization; OG = 1 indicating a news agency. |
Notes: Unit of analysis for each variable is the weekly count data (from last Saturday to this Sunday) of a Twitter account.
Terms: words in a tweet.
Fig 1The study framework.
Fig 2Weekly number of COVID-19 confirmed cases and deaths.
Statistics of the Twitter accounts of nine authoritative public organizations.
| Variable | Name | Min. | 1Q | Median | Mean | 3Q | Max. |
|---|---|---|---|---|---|---|---|
| SS | Sentiment score | -232 | -9.25 | 0 | 24.2 | 15.5 | 900 |
| RP | Mean of replies | 0 | 0 | 10 | 37.7 | 54.8 | 303 |
| FV | Mean of favorites | 0 | 0 | 60.6 | 363 | 581 | 4305 |
| RT | Mean of retweets | 0 | 0.21 | 31.1 | 174 | 258 | 2667 |
| NT | Number of tweets | 0 | 1 | 24.5 | 92.2 | 108 | 757 |
| LT | Length of Text | 0 | 3.75 | 459.5 | 1576 | 1407 | 12639 |
| MF | Median frequency | 0 | 0.75 | 1 | 1.03 | 1 | 7 |
| NI | Number of items | 0 | 3.75 | 195.5 | 567 | 708 | 3552 |
| HF | Highest frequency | 0 | 0.75 | 28 | 83.4 | 78.5 | 714 |
| CC | Confirmed cases | 0 | 0 | 92 | 38424 | 10084 | 619634 |
| ND | Number of deaths | 0 | 0 | 1.5 | 2602 | 428 | 50045 |
| OG | Indicator of organizations | 0 | 1 | 1 | 0.587 | 1 | 1 |
Note: Q: quarter. The aggregated data has 216 (= 9*24) rows. Each row represent the statistics of an organization’s weekly tweets or cases. The 24 weeks are from Decision 1, 2019 to May 10, 2020.
Fig 3Sentiment score, their density and scatter plots of responses to nine organizations’ weekly tweets.
Fig 4Number of responses (replies, favorites, retweets) to positive and negative tweets from the five government organizations and four news agencies.
Fig 5Distributions of the balancing score and case weights.
Fig 6The differential effect of treatment on the responses (replies, favorites, and retweets) before and after justification for the covariates.
The larger the absolute value of the treatment effect is, the larger the size of sentiment polarity’s impact is on the responses of the weekly tweets. The treatment effects represent the mean difference in responses to the tweets with negative sentiment and those responses to the tweets with positive sentiment from the five government organizations and four news agencies. pooled data = government organizations + news agencies.
Fig 7Stability of the differential effect of treatment (linear regression of sentiment score on multiple responses).
Gov Org: Government organization.