| Literature DB >> 34457371 |
Hanjia Lyu1, Junda Wang2, Wei Wu1, Viet Duong2, Xiyang Zhang3, Timothy D Dye4, Jiebo Luo2.
Abstract
Background The current development of vaccines for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is unprecedented. Little is known, however, about the nuanced public opinions on the vaccines on social media. Methods We adopted a human-guided machine learning framework using more than six million tweets from almost two million unique Twitter users to capture public opinions on the vaccines for SARS-CoV-2, classifying them into three groups: pro-vaccine, vaccine-hesitant, and anti-vaccine. After feature inference and opinion mining, 10,945 unique Twitter users were included in the study population. Multinomial logistic regression and counterfactual analysis were conducted. Results Socioeconomically disadvantaged groups were more likely to hold polarized opinions on coronavirus disease 2019 (COVID-19) vaccines, either pro-vaccine ( B = 0.40 , SE = 0.08 , P < 0.001 , OR = 1.49 ; 95 % CI = 1.26 -- 1.75 ) or anti-vaccine ( B = 0.52 , SE = 0.06 , P < 0.001 , OR = 1.69 ; 95 % CI = 1.49 -- 1.91 ). People who have the worst personal pandemic experience were more likely to hold the anti-vaccine opinion ( B = - 0.18 , SE = 0.04 , P < 0.001 , OR = 0.84 ; 95 % CI = 0.77 -- 0.90 ). The United States public is most concerned about the safety, effectiveness, and political issues regarding vaccines for COVID-19, and improving personal pandemic experience increases the vaccine acceptance level. Conclusion Opinion on COVID-19 vaccine uptake varies across people of different characteristics.Entities:
Keywords: COVID-19; Public opinion; Social Media; Twitter; Vaccine hesitancy
Year: 2021 PMID: 34457371 PMCID: PMC8384764 DOI: 10.1016/j.imed.2021.08.001
Source DB: PubMed Journal: Intell Med ISSN: 2667-1026
Figure 1The diagram of data preprocessing procedures.
Labeling scheme for Tweets
| Category | Description |
|---|---|
| Pro-vaccine | i. Claiming that they would take the vaccine once it is available |
| ii. Advocating and supporting vaccine/vaccine-associated entities like vaccine experiment trials | |
| iii. Believing that the vaccine will be the solution to the pandemic | |
| Vaccine-hesitant | i. Claiming that they would like to take the vaccine after the vaccine is proven safe/effective |
| ii. Claiming that they would wait for a while and see whether a vaccine is truly safe/effective | |
| if there is one | |
| iii. Showing worries about the effectiveness of a rushed vaccine | |
| Anti-vaccine | i. Promoting/arguing in favor of conspiracy theory about vaccine/vaccine-associated entities |
| ii. Believing that an effective vaccine would not be invented quickly and help overcome | |
| the pandemic | |
| iii. Believing that a covid-19 vaccine is dangerous for whatever reasons and would not take it | |
| even though the commenters claim that they are not anti-vaccine | |
| Irrelevant | i. Vaccine News. No written opinion from the commenters |
| ii. Including vaccine and the commenters opinions, but the focus is something else | |
| (i.e., insurance, politics, personal life experience, economics, emotional complaints, etc.) | |
| iii. Comments/questions on vaccines/vaccine-associated entities but with unclear meanings |
Figure 2Distributions of different categories of the original and final training corpora.
Figure 3Performance of of each iteration.
Performance of the four-class XLNet model
| Class | Precision | Recall | F1-score |
|---|---|---|---|
| Irrelevant | 0.45 | 0.84 | 0.59 |
| Pro-vaccine | 0.78 | 0.52 | 0.62 |
| Vaccine-hesitant | 0.77 | 0.54 | 0.64 |
| Anti-vaccine | 0.79 | 0.61 | 0.69 |
| Overall | 0.70 | 0.63 | 0.63 |
Figure 4Counterfactual analyses illustrate the importance of politics, safety and effectiveness factor indicators, and personal pandemic experience.
Figure 510 topics extracted from the tweets with the top 30 keywords.
Descriptive statistics of the factor indicators
| Variables | Number | Mean | SD | Min | Max |
|---|---|---|---|---|---|
| Politics | 10,945 | 0.2512 | 0.4337 | 0 | 1 |
| Safety and effectiveness | 10,945 | 0.1801 | 0.3843 | 0 | 1 |
Figure 6(a) The proportions of the opinion groups from September 28 to November 4, 2020. (b) Number of Twitter users from September 28 to November 4, 2020. The data of October 5, 2020 are missing due to a data collection issue.
Figure 7State-level public opinions about potential COVID-19 vaccines. The numbers in parentheses stand for the sizes of the study populations.
Figure 8The percentages of the pro-vaccine groups of the national average (US), Nevada (NV), Tennessee (TN), and Washington (WA).
Descriptive statistics and the bi-variate correlations
| Variables | Mean | SD | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1. Gender (0 = male, | 0.46 | 0.50 | |||||||||||||||||||
| 2. Age (years) | 39.89 | 14.69 | |||||||||||||||||||
| 3. Verified (0 = no, | 0.04 | 0.20 | |||||||||||||||||||
| 4. Twitter history (months) | 91.30 | 43.47 | |||||||||||||||||||
| 5. # Followers | 1.60 | 1.63 | |||||||||||||||||||
| 6. # Friends | 1.95 | 1.25 | |||||||||||||||||||
| 7. # Listed memberships | 0.93 | ||||||||||||||||||||
| 8. # Favorites | 4.17 | 1.95 | |||||||||||||||||||
| 9. # Statuses | 4.09 | 1.43 | |||||||||||||||||||
| 10. Higher-Income | 0.00 | 0.05 | .01 | ||||||||||||||||||
| 11. Lower-Income | 0.76 | 0.43 | |||||||||||||||||||
| 12. Religious (0 = no, | 0.04 | 0.19 | .01 | .02 | .02 | ||||||||||||||||
| 13. Having kids (0 = no, | 0.12 | 0.32 | .01 | ||||||||||||||||||
| 14. Following Trump | 0.11 | 0.31 | |||||||||||||||||||
| 15. Following Biden | 0.17 | 0.38 | .00 | .01 | |||||||||||||||||
| 16. Rural (0 = no, | 0.19 | 0.40 | .00 | ||||||||||||||||||
| 17. Suburban (0 = no, | 0.14 | 0.35 | |||||||||||||||||||
| 18. Pandemic experience | 0.06 | 0.80 | .01 | .00 | |||||||||||||||||
| 19. Non-pandemic | 0.62 | 0.75 | .01 | .02 | .02 | .00 | .01 | ||||||||||||||
| 20. Pandemic severity | 0.01 | 0.00 | - | .01 | .01 |
Note: * . ** . # The numbers of followers, friends, listed memberships, favorites, statuses are normalized by the months of Twitter history and log-transformed.
Multinomial logistic regression outputs for the opinion on potential COVID-19 vaccines against demographics and other variables of interest
| Predictor | Anti-vaccine | Pro-vaccine | ||||
|---|---|---|---|---|---|---|
| OR (95% CI) | OR (95% CI) | |||||
| Intercept | 0.26 | 0.20 | ||||
| Age (years) | 0.00 | 0.00 | 1.00 (1.00, 1.01) | 0.00 | 1.01 (1.01, 1.02) | |
| Twitter history (months) | 0.00 | 0.00 | 1.00 (1.00, 1.00) | 0.001 | 0.997 (0.996, 0.999) | |
| # Followers | 0.04 | 1.32 (1.22, 1.42) | 0.03 | 1.09 (1.02, 1.16) | ||
| # Friends | 0.04 | 0.83 (0.77, 0.90) | 0.03 | 0.93 (0.88, 0.99) | ||
| # Listed memberships | 0.06 | 0.53 (0.47, 0.60) | 0.05 | 1.10 (1.01, 1.20) | ||
| # Favorites | 0.03 | 1.12 (1.06, 1.19) | 0.02 | 1.04 (1.01, 1.08) | ||
| # Statuses | 0.03 | 1.12 (1.06, 1.19) | 0.02 | 0.94 (0.90, 0.99) | ||
| Pandemic experience (sentiment) | 0.04 | 0.84 (0.77, 0.90) | 0.03 | 1.24 (1.16, 1.32) | ||
| Non-pandemic experience (sentiment) | 0.04 | 0.96 (0.89, 1.04) | 0.03 | 1.14 (1.07, 1.22) | ||
| Pandemic severity perception | 8.13 | 0.00 (0.00, 2.58) | 6.59 | 0.00 (0.00, 0.00) | ||
| Female | 0.06 | 0.78 (0.69, 0.88) | 0.05 | 0.63 (0.57, 0.69) | ||
| Verified user | 0.27 | 0.54 (0.32, 0.91) | 0.14 | 0.85 (0.65, 1.12) | ||
| Higher-income | 5.00e+36 | 0.00 (0.00, Inf) | 0.47 | 0.43 | 1.60 (0.70, 3.68) | |
| Lower-income | 0.08 | 1.49 (1.26, 1.75) | 0.06 | 1.69 (1.49, 1.91) | ||
| Religious | 0.17 | 2.10 (1.52, 2.91) | 0.15 | 1.45 (1.07, 1.95) | ||
| Having kids | 0.10 | 0.90 (0.74, 1.09) | 0.00 | 0.08 | 1.00 (0.86, 1.15) | |
| Following Trump | 0.10 | 1.51 (1.26, 1.83) | 0.06 | 0.08 | 1.06 (0.90, 1.25) | |
| Following Biden | 0.10 | 0.29 (0.24, 0.36) | 0.06 | 0.71 (0.63, 0.80) | ||
| Rural | 0.08 | 1.19 (1.01, 1.39) | 0.07 | 0.07 | 1.07 (0.94, 1.22) | |
| Suburban | 0.09 | 1.20 (1.01, 1.43) | 0.11 | 0.07 | 1.12 (0.97, 1.29) | |
| Chi-square | ||||||
| 40 | ||||||
| 20,171.90 | ||||||
| McFadden’s pseudo | 0.06 | |||||
| Sample size | 10,945 | |||||
Note: * . ** . *** . The vaccine hesitant group is selected as the reference category. # The numbers of followers, friends, listed memberships, favorites, statuses are normalized by the months of Twitter history and log-transformed.