| Literature DB >> 34721721 |
Ruchi Mittal1, Amit Mittal2, Ishan Aggarwal3.
Abstract
This study aims to conduct text mining of affective valence of the sentiments generated on social media during the COVID-19 and measure their association with different outcomes of the disease. 50,000 tweets per day over 23 days during the pandemic were extracted using the VADER sentiment analysis tool. Overall, tweets could effectively be classified in terms of polarity, i.e., "positive," "negative" and "neutral" sentiments. Furthermore, on a day-to-day basis, the study identified a positive and significant relationship between COVID-19-related (a) global infections and negative tweets, (b) global deaths and negative tweets, (c) recoveries and negative tweets, and (d) recoveries and positive tweets. No significant association could be found between (e) infections and positive tweets and (f) deaths and positive tweets. Furthermore, the statistical analysis also indicated that the daily distribution of tweets based on polarity generates three distinct and significantly different numbers of tweets per category, i.e., positive, negative and neutral. As per the results generated through sentiment analysis of tweets in this study, the emergence of "positive" tweets in such a gloomy pandemic scenario shows the inherent resilience of humans. The significant association between news of COVID-19 recoveries and positive tweets seems to hint at a more optimistic scenario whenever the pandemic finally comes to an end or is controlled. Such public reactions-for good-have the potential to go viral and influence several others, especially those who are classified as "neutral" or fence-sitters.Entities:
Keywords: Affective valence; COVID-19; Coronavirus; Pandemic; SARS-CoV-2; Sentiment analysis; Twitter
Year: 2021 PMID: 34721721 PMCID: PMC8548272 DOI: 10.1007/s13278-021-00828-x
Source DB: PubMed Journal: Soc Netw Anal Min
Pandemics in the age of Twitter.
Source: Authors compilation
| Disease | Study |
|---|---|
| Ebola | Oyeyemi et al. ( |
| Swine Flu (H1N1) | Ahmed et al. ( |
| Influenza-like illness (ILI) | Velardi et al. ( |
| Bird Flu (H7N9) | Hellsten et al. ( |
| MERS | Fung et al. ( |
| SARS | St Louis and Zorlu ( |
| ZIKA | Pruss et al. ( |
| COVID-19 (SARS-CoV-2) | Jahanbin and Rahmanian ( |
| Escherichia coli outbreak | Diaz-Aviles and Stewart ( |
| Measles | Tang et al. ( |
| Yellow fever | Ortiz-Martínez and Jiménez-Arcia ( |
Timeline of the COVID-19 outbreak.
Source: https://www.worldometers.info/coronavirus/
| Event | Date |
|---|---|
| First COVID-19 case reported in China | December 1, 2019 |
| First COVID-19 case reported outside China | January 13, 2020 (in Thailand) |
| No new infection reported from Wuhan—ground zero | March 19, 2020 |
| Epicenter shifts from China to Europe | March 10, 2020 |
| WHO declares COVID-19 as a global pandemic | March 11, 2020 |
| Top five countries with the highest number of reported COVID-19 infections: | |
| (1) China, (2) Italy, (3) USA, (4) Spain, (5) Germany | March 26, 2020 |
| (1) USA, (2) Italy, (3) China, (4) Spain, (5) Germany | March 27, 2020 |
Fig. 1Data analysis framework
Random tweets extracted from March 16, 2020–March 23, 2020
| Category of tweets | Total tweets (Percentage of Total) |
|---|---|
| Total tweets captured | 70,095 |
| Positive tweets | 23,057 (33%) |
| Negative tweets | 17,471 (30%) |
| Neutral tweets | 21,945 (37%) |
Description and samples of tweet categories
| Positive tweets | “We are in this together though apart. Helping people remain at home will save lives and keep people safe…” “USA stands by Italy during these trying times Share your Support for our Italian friends, they are our colleagues and friends…” “Best wishes for a speedy recovery and take care South Korea and have a great success through testing…” |
| Negative tweets | “The will be the scapegoat The US economy was headed for a crash anyway due to the reckless and incompetent White House…” “It is expected at least the same amount of people to die to this vile rancid virus….The least we can do whilst we still can…” “IRONY IS DEAD Mike Pompeo just criticized China for hiding the Coronavirus from us” |
| Neutral tweets | “US virologist on COVID- This will end but expect more” “ XYD Clinic Labs ramp up for round-the-clock COVID- testing” “Well…. there goes my wedding..” |
Day-wise (23 days in March 2020), negative and positive tweets and the number of COVID-19 infected people across the world, related deaths and recoveries (in China).
Source: for (a) and (b): Extracted by the authors; for (c) and (d) https://covid.ourworldindata.org/data/ecdc/total_cases.csv accessed on 24.03.2020 and for (e)
| Date | Positive tweets (a) | Negative tweets (b) | Neutral tweets (c) | Infections (c) | Deaths (d) | Recoveries (e) |
|---|---|---|---|---|---|---|
| 1-Mar-20 | 18,420 | 13,966 | 17,214 | 87,024 | 3050 | 44,462 |
| 2-Mar-20 | 12,020 | 11,917 | 25,116 | 89,068 | 3117 | 47,204 |
| 3-Mar-20 | 10,520 | 12,354 | 11,234 | 90,663 | 3202 | 49,856 |
| 4-Mar-20 | 10,113 | 10,250 | 29,540 | 93,076 | 3285 | 52,045 |
| 5-Mar-20 | 15,889 | 12,281 | 14,630 | 95,315 | 3387 | 53,726 |
| 6-Mar-20 | 12,030 | 14,281 | 13,687 | 98,171 | 3494 | 55,404 |
| 7-Mar-20 | 16,346 | 15,366 | 14,286 | 102,132 | 3599 | 57,065 |
| 8-Mar-20 | 10,500 | 14,500 | 13,004 | 105,823 | 3827 | 58,600 |
| 9-Mar-20 | 10,132 | 13,981 | 25,843 | 109,694 | 4025 | 59,897 |
| 10-Mar-20 | 20,080 | 16,010 | 13,820 | 114,231 | 4296 | 61,475 |
| 11-Mar-20 | 10,850 | 14,516 | 12,544 | 118,609 | 4628 | 62,793 |
| 12-Mar-20 | 15,892 | 19,250 | 13,658 | 125,496 | 4981 | 64,111 |
| 13-Mar-20 | 11,003 | 15,421 | 14,110 | 133,848 | 5429 | 65,541 |
| 14-Mar-20 | 20,112 | 12,118 | 17,010 | 143,223 | 5833 | 66,911 |
| 15-Mar-20 | 16,524 | 14,992 | 15,444 | 151,363 | 6519 | 67,749 |
| 16-Mar-20 | 12,902 | 15,258 | 21,745 | 167,414 | 7161 | 68,679 |
| 17-Mar-20 | 17,032 | 15,772 | 17,123 | 180,159 | 7978 | 69,601 |
| 18-Mar-20 | 16,838 | 15,658 | 17,010 | 194,909 | 8951 | 70,420 |
| 19-Mar-20 | 29,069 | 15,204 | 3445 | 213,254 | 10,031 | 71,150 |
| 20-Mar-20 | 27,445 | 16,753 | 6407 | 242,473 | 11,387 | 71,740 |
| 21-Mar-20 | 16,490 | 14,594 | 17,900 | 271,228 | 13,013 | 72,244 |
| 22-Mar-20 | 17,620 | 16,200 | 15,010 | 305,275 | 14,641 | 72,703 |
| 23-Mar-20 | 12,755 | 18,569 | 17,009 | 338,303 | 16,514 | 73,159 |
The graphical representation are described as per Figs. 2 and 3
Fig. 2COVID-19 outcomes
Fig. 3Tweet categories
Correlation matrix: association between tweet category and COVID-19-related cases
| Infections | Deaths | Recoveries | Positive tweets | Negative tweets | Neutral tweets | |
|---|---|---|---|---|---|---|
| Infections | 1 | |||||
| Deaths | 0.996** | 1 | ||||
| Recoveries | 0.817** | 0.814** | 1 | |||
| Positive tweets | 0.379 | 0.388 | 0.417* | 1 | ||
| Negative tweets | 0.559** | 0.555** | 0.628** | 0.292 | 1 | |
| Neutral tweets | − 0.084 | − 0.086 | − 0.213 | − 0.389 | − 0.435* | 1 |
*Correlation is significant at the 0.05 level (two-tailed) **correlation is significant at the 0.01 level (two-tailed)—technique used is Pearson correlation coefficient
Descriptive statistics
| Variables | Minimum | Maximum | Mean | Std. deviation | |
|---|---|---|---|---|---|
| Positive count | 23 | 10,113 | 29,069 | 15,677 | 5118.81 |
| Negative count | 23 | 10,250 | 19,250 | 14,748 | 2071.08 |
| Neutral count | 23 | 3445 | 29,540 | 15,947 | 5737.51 |
ANOVA for COVID outcomes and tweet counts
| COVID outcomes | Sum of squares | Df | Mean square | Sig. | |
|---|---|---|---|---|---|
| Between groups | 657,357,828.193 | 2 | 328,678,914.096 | 0.056 | 0.046** |
| Within groups | 117,913,285,052.764 | 20 | 5,895,664,252.638 | ||
| Total | 118,570,642,880.957 | 22 | |||
| Between groups | 2,052,770.749 | 2 | 1,026,385.374 | 0.059 | 0.034** |
| Withi groups | 348,784,384.556 | 20 | 17,439,219.228 | ||
| Total | 350,837,155.304 | 22 | |||
| Between groups | 14,791,553.359 | 2 | 7,395,776.680 | 0.088 | 0.916*** |
| Within groups | 1,676,724,071.597 | 20 | 83,836,203.580 | ||
| Total | 1,691,515,624.957 | 22 | |||
***Significant at 1.0% level, **Significant at 0.05% level
| Exhibit 1: Stages of processing a raw tweet (example) |
|---|
| Original tweet: |
| We stand by Italy during these trying times. Share your Support for our Italian friends, They are our colleagues, friends, and family. |
| Tweet content extracted and saved: |
| We stand by Italy during these trying times. Share your Support for our Italian friends, They are our colleagues, friends, and family.#COVID-19 #WeStandWithItaly |
| Preprocessing of original tweet: |
| We stand by Italy during these trying times. Share your Support for our Italian friends, They are our colleagues, friends, and family. #COVID-19 #WeStandWithItaly |
| We stand by Italy during these trying times. Share your Support for our Italian friends, They are our colleagues, friends, and family |
| We stand by Italy during these trying times share your support for our |
| Italian friends they are our colleagues friends and family |
| Cleaned tweet: |
| “We stand by Italy during these trying times share your support for our Italian friends they are our colleagues, friends, and family…” |