| Literature DB >> 33536695 |
Keke Hou1, Tingting Hou2, Lili Cai3.
Abstract
The COVID-19 epidemic is influencing global population. Social media has become important platforms to acquire and exchange information during the outbreak of COVID-19. This study explores public attention on social media. Popular Weibo texts related to COVID-19 with "coronavirus" and "pneumonia" as the keywords during December 27, 2019 and May 31, 2020 were collected in our study for public attention analysis. By combining data mining and text analysis, the public attention level trend in different stages were presented. Then a correlation analysis between public attention level and COVID-19 related cases number, topic analysis, and sentiment analysis were conducted. Significant positive correlation between public attention level and COVID-19 related cases number was identified. Based on Latent Dirichlet Allocation model, topic extraction was implemented in different stages and 41 topics were identified totally. For a comprehensive understanding of public emotions, sentiment analysis was performed. This study provides valuable lessons for public response to COVID-19.Entities:
Keywords: COVID-19; Correlation analysis; Public attention level; Sentiment analysis; Topics analysis
Year: 2021 PMID: 33536695 PMCID: PMC7843112 DOI: 10.1016/j.paid.2021.110701
Source DB: PubMed Journal: Pers Individ Dif ISSN: 0191-8869
Fig. 1The process of topic extraction.
Fig. 2The overall trend of public attention level related to COVID-19 popular Weibo.
Fig. 3The trend of public attention level related to COVID-19 popular Weibo in different stages.
The descriptive statistics.
| Variables | Minimum value | Maximum value | Mean value | Standard deviation | Number (N) |
|---|---|---|---|---|---|
| Public attention level | 0 | 22,566,518 | 2,882,824.439 | 2,854,732.627 | 157 |
| New confirmed cases number | 0 | 15,152 | 532.166 | 1510.627 | |
| New suspected cases number | 0 | 5328 | 625.070 | 1358.127 | |
| New deaths number | 0 | 254 | 21.943 | 40.531 |
Correlations between public attention level and COVID-19 related cases number.
| Variables | Correlation coefficient | |
|---|---|---|
| Public attention level | New confirmed cases number | 0.823⁎⁎⁎ |
| New suspected cases number | 0.873⁎⁎⁎ | |
| New deaths number | 0.804⁎⁎⁎ | |
| Number (N) | 157 | |
| Public attention level | New confirmed cases number | 0.706⁎⁎⁎ |
| New suspected cases number | 0.711⁎⁎⁎ | |
| New deaths number | 0.673⁎⁎⁎ | |
| Number (N) | 56 | |
| Public attention level | New confirmed cases number | 0.396⁎⁎ |
| New suspected cases number | 0.680⁎⁎⁎ | |
| New deaths number | 0.696⁎⁎⁎ | |
| Number (N) | 68 | |
Note: ***p < 0.001, **p < 0.01 (2-tailed).
Correlations among three COVID-19 related cases number.
| Variables | Correlation coefficient | |
|---|---|---|
| New confirmed cases number | New suspected cases number | 0.905⁎⁎⁎ |
| New confirmed cases number | New deaths number | 0.850⁎⁎⁎ |
| New suspected cases number | New deaths number | 0.903⁎⁎⁎ |
| Number (N) | 157 | |
Note: ***p < 0.001 (2-tailed).
Correlations between daily popular Weibo texts number and other variables.
| Variables | Correlation coefficient | |
|---|---|---|
| Popular Weibo texts number | Public attention level | 0.792*** |
| New confirmed cases number | 0.765*** | |
| New suspected cases number | 0.820*** | |
| New deaths number | 0.795*** | |
| Number (N) | 157 | |
Note: ***p < 0.001 (2-tailed).
The correlations among three indicators of public attention level and new cases number.
| Variables | Correlation coefficient | |
|---|---|---|
| (Daily) comments number | (Daily) retweeting number | 0.928*** |
| (Daily) comments number | (Daily) thump-up number | 0.975*** |
| (Daily) retweeting number | (Daily) thump-up number | 0.921*** |
| (Daily) comments number | New confirmed cases number | 0.823*** |
| New suspected cases number | 0.875*** | |
| New deaths number | 0.820*** | |
| (Daily) retweeting number | New confirmed cases number | 0.853*** |
| New suspected cases number | 0.890*** | |
| New deaths number | 0.814*** | |
| (Daily) thump-up number | New confirmed cases number | 0.815*** |
| New suspected cases number | 0.865*** | |
| New deaths number | 0.796*** | |
| Number (N) | 157 | |
Note: ***p < 0.001 (2-tailed).
Correlations between COVID-19 related cases number and lag value of public attention level.
| Variables | New confirmed cases number | New suspected cases number | New deaths number |
|---|---|---|---|
| Lag 0 day | 0.823⁎⁎⁎ | 0.873⁎⁎⁎ | 0.804⁎⁎⁎ |
| Lag 1 day | 0.837⁎⁎⁎ | 0.845⁎⁎⁎ | 0.801⁎⁎⁎ |
| Lag 2 day | 0.835⁎⁎⁎ | 0.822⁎⁎⁎ | 0.800⁎⁎⁎ |
| Lag 3 day | 0.832⁎⁎⁎ | 0.809⁎⁎⁎ | 0.777⁎⁎⁎ |
| Lag 4 day | 0.807⁎⁎⁎ | 0.772⁎⁎⁎ | 0.767⁎⁎⁎ |
| Lag 5 day | 0.778⁎⁎⁎ | 0.746⁎⁎⁎ | 0.753⁎⁎⁎ |
| Lag 6 day | 0.760⁎⁎⁎ | 0.722⁎⁎⁎ | 0.747⁎⁎⁎ |
| Number (N) | 157 | ||
Note: ***p < 0.001 (2-tailed).
The COVID-19 related topics in the first stage (December 27, 2019- January 19, 2020).
| Topic name | Rate (%) | LDA keywords | |
|---|---|---|---|
| 1 | Human-to-human transmission in Wuhan | 35.08 | Coronavirus, novel, Wuhan, human-to-human, pneumonia, cases, detection, infect |
| 2 | Epidemic situation in Wuhan | 18.19 | Cases, pneumonia, coronavirus, infect, detection, hospital discharge, newly increased, Wuhan |
| 3 | Cases have been detected in Japan | 13.4 | Coronavirus, novel, Japan, pneumonia, cases, patients, detection, infect |
| 4 | Cases have been detected in Thailand | 9.26 | novel, cases, Thailand, coronavirus, find, infect, pneumonia |
| 5 | Unidentified coronavirus found in Wuhan | 8.93 | Coronavirus, pneumonia, patients, cases, find, infect, Wuhan, unknown |
| 6 | Novel coronavirus was preliminarily determined controllable | 8.71 | Pneumonia, coronavirus, unknown, epidemic, novel, preliminarily, controllable, determined |
| 7 | Novel coronavirus symptoms | 6.43 | Pneumonia, cases, Wuhan city, coronavirus, ages, fever, detection, patients |
The COVID-19 related topics in the second stage (January 20–February 20, 2020).
| Topic name | Rate (%) | LDA keywords | |
|---|---|---|---|
| 1 | Treatment condition | 25.04 | Clinic, treatment, pneumonia, quarantine, detection, treat and cure, nucleic acid, traditional Chinese medicine |
| 2 | Epidemic prevention and control | 16.56 | Epidemic, prevention and control, achieve, epidemic prevention, community, face masks, wear, alcohol |
| 3 | Notification of epidemic situation | 13.12 | Cases, confirmed, newly increased, hospital discharge, accumulation, novel, cured, remain in hospital for observation |
| 4 | Global attention on the epidemic in China | 11.69 | Epidemic, China, coronavirus, World Health Organization, spread, global, international |
| 5 | The medical team supported Wuhan | 11.62 | China, epidemic, Wuhan, Hubei, fight, medical team, support, urgency |
| 6 | Novel coronavirus scientific research | 8.74 | Coronavirus, novel, research, academician, laboratory, Zhong Nanshan, SARS, vaccine |
| 7 | Front-line clinical staff | 6.84 | Epidemic, doctor, nurses, pneumonia, medical worker, hospitals, front-line, family |
| 8 | Medical resources | 3.22 | Detection test kits, reagent, hospital, novel, coronavirus, supplies, detection, medical |
| 9 | Route of transmission | 3.18 | Transmission, disinfection, contact, aerosol, droplet, ventilation, face masks, wash hands |
The COVID-19 related topics in the third stage (February 21–March 17, 2020).
| Topic name | Rate (%) | LDA keywords | |
|---|---|---|---|
| 1 | Notification of epidemic situation | 15.01 | Cases, confirmed, newly increased, pneumonia, cured, hospital discharge, death, medical observation |
| 2 | Fight against epidemic | 14.41 | Epidemic, pneumonia, novel coronavirus, fight, anti-epidemic, nation, news, prevention and control |
| 3 | Prevention of overseas imported cases | 13.82 | Epidemic, prevention and control, overseas, people, detection, measures, enter the country, nucleic acid testing |
| 4 | The spread of epidemic | 13.39 | Epidemic, China, the United States, global, novel coronavirus, nation, pneumonia, spread, Europe |
| 5 | Epidemic in other countries | 12.72 | Novel coronavirus, infection, pneumonia, Iran, Britain, Spain, Ministry of Health, detection |
| 6 | Epidemic situation in Wuhan | 8.64 | Novel coronavirus, pneumonia, Wuhan, virus, News, Research, Zhong Nanshan, specialists |
| 7 | Epidemic situation in the United States | 7.39 | the United States, President, novel coronavirus, face masks, epidemic, pneumonia, state of emergency, infection |
| 8 | Epidemic situation in Germany | 5.61 | Novel coronavirus, pneumonia, Germany, competition, sports, epidemic, World Health Organization |
| 9 | Epidemic situation in Korea | 4.89 | Korea, novel coronavirus, pneumonia, confirmed, infection, Daegu, daily, church |
| 10 | Epidemic situation in Italy | 4.10 | Confirmed, pneumonia, Germany, cases, novel coronavirus, quarantine, cases, first case |
The COVID-19 related topics in the fourth stage (March 18–April 28, 2020).
| Topic name | Rate (%) | LDA keywords | |
|---|---|---|---|
| 1 | Epidemic situation in the United States | 23.22 | The United States, coronavirus, deaths, number of people, virus, confirmed, ten thousand, infection |
| 2 | Notification of epidemic situation | 22.28 | Confirmed, newly increased, accumulation, imported, overseas, hospital discharge, medical observation, death |
| 3 | Global response to the epidemic | 16.87 | Epidemic, China, global, response, provide, economics, international, organization |
| 4 | Epidemic prevention and control | 10.11 | Novel, coronavirus, fight, prevention and control, health, detection, close, temporary |
| 5 | Epidemic situation in Japan | 8.03 | Coronavirus, Japan, novel, epidemic, affect, Olympic Games, World Health Organization |
| 6 | Viral vaccine | 6.64 | Coronavirus, vaccine, novel, research, treatment, antibody, clinic, clinical test |
| 7 | Enterprises resumed work and production | 5.80 | Epidemic, prevention and control, enterprises, resumed work, measures, aggregation, release, resumed production |
| 8 | The reinstatement of schools | 3.81 | Face masks, school reopen, school, prevention and control, students, grades, achieve, prevention |
| 9 | Medical resources | 3.24 | Britain, China, detection test kits, medical, reagent, detection, breathing machine, products |
The COVID-19 related topics in the fifth stage (April 29–May 31, 2020).
| Topic name | Rate (%) | LDA keywords | |
|---|---|---|---|
| 1 | Epidemic prevention and control | 32.74 | Epidemic, prevention and control, work, detection, achieve, normalcy, place, fever |
| 2 | Epidemic situation in the United States | 22.84 | the United States, novel coronavirus, pneumonia, death, President, time, ten thousand, epidemic |
| 3 | Notification of epidemic situation | 18.59 | Confirmed, newly increased, accumulation, imported, overseas, report, hospital discharge, asymptomatic |
| 4 | Economic impact of the epidemic | 16.64 | Epidemic, China, pneumonia, economics, affect, global, Japan, tourism |
| 5 | The students resumed their studies | 5.77 | School, students, resume classes, resume studies, back to school, prevention and control, time, school reopens |
| 6 | Epidemic situation in Brazil | 3.42 | Cases, confirmed, Brazil, death, accumulation, newly increased, ten thousands of cases, one day |
The correlation between sentiment score and public attention level.
| Variables | Minimum value | Maximum value | Mean Value | Standard deviation | Correlation coefficient |
|---|---|---|---|---|---|
| Sentiment score | 0.072 | 0.724 | 0.535 | 0.095 | 0.562 |
| Public attention level | 0 | 22,566,518 | 2,882,824.439 | 2,854,732.627 |
Number (N) 157.
p < 0.001.
Fig. 4The overall trend of sentiment analysis related to COVID-19 popular Weibo.