| Literature DB >> 35382528 |
Yi Zhao1, Haixu Xi1, Chengzhi Zhang1.
Abstract
Coronavirus disease 2019 (COVID-19) pandemic-related information are flooded on social media, and analyzing this information from an occupational perspective can help us to understand the social implications of this unprecedented disruption. In this study, using a COVID-19-related dataset collected with the Twitter IDs, we conduct topic and sentiment analysis from the perspective of occupation, by leveraging Latent Dirichlet Allocation (LDA) topic modeling and Valence Aware Dictionary and sEntiment Reasoning (VADER) model, respectively. The experimental results indicate that there are significant topic preference differences between Twitter users with different occupations. However, occupation-linked affective differences are only partly demonstrated in our study; Twitter users with different income levels have nothing to do with sentiment expression on covid-19-related topics.Entities:
Keywords: COVID-19; Twitter; occupational differences; sentiment analysis; topic discovery
Year: 2022 PMID: 35382528 PMCID: PMC8969477 DOI: 10.2478/dim-2020-0032
Source DB: PubMed Journal: Data Inf Manag ISSN: 2543-9251
The Distribution of Occupations of Twitter Users
| Income Level | Occupation | Occupation Abbreviations | Number of tweets |
|---|---|---|---|
| High | Computer and Information Research Scientists | CIRS | 1,652 |
| Marketing Managers | MM | 1,700 | |
| Dentists, General | DEN | 1,835 | |
| Medium | Management Analysts | MA | 1,909 |
| Business Teachers, Postsecondary | BTP | 1,736 | |
| Financial Analysts | FA | 1,841 | |
| Low | Farmworkers, Farm, Ranch, and Aquacultural Animals | FFRAA | 1,876 |
| Production Workers, All Other | PW | 1,750 | |
| Landscaping and Groundskeeping Workers | LGW | 1,988 |
Figure 1Coherence measurement for bag of words (BOW) and term frequency–inverse document frequency (TF-IDF).
Topics in COVID-19-Associated Tweets
| Topic | Themes | Top 15 most relevant words | Number of tweets |
|---|---|---|---|
| 0 | Preparation for reopening | 35,336 | |
| 1 | President's lies about COVID-19 | 40,838 | |
| 2 | Coronavirus new cases and deaths | 50,863 | |
| 3 | * | 35,531 | |
| 4 | Free online support | 49,828 | |
| 5 | * | 107,280 | |
| 6 | Protests against the stay-at-home order | 25,435 | |
| 7 | * | 62,833 | |
| 8 | * | 29,162 | |
| 9 | The risk caused by COVID-19 | 54,488 | |
| 10 | Measures to slow the spread of COVID-19 | 48,237 | |
| 11 | Research on the vaccine and treatment | 39,989 | |
| 12 | Virus misinformation and fake news | 21,210 | |
| 13 | * | 21,657 |
Note: * indicates that it is difficult to assign a specific theme to the topic and will not be analyzed in a subsequent section.
Figure 2Topic distribution of Twitter users engaged in different occupations.
Figure 3Sentiment distribution on the topics 0, 1, 2, and 4.
Figure 4Sentiment distribution on the topics 6, 9, 10, 11, and 12.