| Literature DB >> 34988056 |
Quan Xiao1, Weiling Huang1, Xing Zhang2, Shanshan Wan1, Xia Li1.
Abstract
The capturing of social opinions, especially rumors, is a crucial issue in digital public health. With the outbreak of the COVID-19 pandemic, the discussions of related topics have increased exponentially in social media, with a large number of rumors on the Internet, which highly impede the harmony and sustainable development of society. As human health has never suffered a threat of this magnitude since the Internet era, past studies have lacked in-depth analysis of rumors regarding such a globally sweeping pandemic. This text-based analysis explores the dynamic features of Internet rumors during the COVID-19 pandemic considering the progress of the pandemic as time-series. Specifically, a Latent Dirichlet Allocation (LDA) model is used to extract rumor topics that spread widely during the pandemic, and the extracted six rumor topics, i.e., "Human Immunity," "Technology R&D," "Virus Protection," "People's Livelihood," "Virus Spreading," and "Psychosomatic Health" are found to show a certain degree of concentrated distribution at different stages of the pandemic. Linguistic Inquiry and Word Count (LIWC) is used to statistically test the psychosocial dynamics reflected in the rumor texts, and the results show differences in psychosocial characteristics of rumors at different stages of the pandemic progression. There are also differences in the indicators of psychosocial characteristics between truth and disinformation. Our results reveal which topics of rumors and which psychosocial characteristics are more likely to spread at each stage of progress of the pandemic. The findings contribute to a comprehensive understanding of the changing public opinions and psychological dynamics during the pandemic, and also provide reference for public opinion responses to major public health emergencies that may arise in the future.Entities:
Keywords: COVID-19 pandemic; internet rumors; latent dirichlet allocation; linguistic inquiry and word count; public psychologies; topic modeling
Mesh:
Year: 2021 PMID: 34988056 PMCID: PMC8722471 DOI: 10.3389/fpubh.2021.788848
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Figure 1Research framework.
Figure 2Word2Vec model.
Figure 3The LDA model.
Figure 4Calculation of perplexity.
Figure 5Calculation of coherence.
Figure 6Visualization of topics.
Extracted topics, keywords and examples of rumor texts.
|
|
|
|
|
|---|---|---|---|
| Topic1 | Human immunity | Antibodies, Body, Cell, Immunity, | “Bird's nest can boost immune system and prevent new coronavirus,” “Resistance can be improved by heavy exercise during the pandemic” |
| Topic2/4 | Technology R&D | Chloroquine, Zhong Nanshan, Gene, Academician, Italy, USA, Sequence, Japan | “Resveratrol may treat and prevent novel coronavirus,” “India has developed a nano spray, a spray object surface 90 days free of viruses” |
| Topic3 | Virus protection | USA, Influenza, Filtration, Protection, Ultraviolet light, Wearing, Materials, Medical | “Do not wear sweaters or clothing jackets with fur collars or fleece, they tend to attract viruses,” “Flower lotion with 70–75% alcohol content can effectively prevent the new coronavirus” |
| Topic5 | People's livelihood | Start of the new term, school, grade, internet, refuting rumors, official, media, social | “During the pandemic, you cannot go out for a walk, as it is easy to be infected,” “2020 National College Entrance Examination postponed for one month” |
| Topic6 | Virus spreading | Exposure, patients, droplets, disease, seafood, south in china, crowd, air | “Pangolin as an intermediate host for the novel coronavirus,” “Willow flock can carry new coronaviruses, leading to trans-regional spreading” |
| Topic7 | Psychosomatic health | Isolation, high blood pressure, blood vessels, heart, antihypertensive drugs, exercise, injury, nervous | “Antihypertensive drugs increase the risk of viral infections in patients with high blood pressure,” “Children wearing N95 masks may cause irreversible damage” |
Figure 7Data on progress of the COVID-19 pandemic in China.
Figure 8Dynamics in the number of rumors on different topics.
T-test results for truth and misinformation.
|
|
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| ||||
| Psychological characteristics | Social | −3.210 | 0.001 | −0.011 | Linguistic characteristics | Verb | 2.845 | 0.005 | 0.014 |
| Friend | −2.138 | 0.033 | −0.001 | AuxVerb | 3.384 | 0.001 | 0.010 | ||
| Affect | 2.418 | 0.016 | 0.008 | Number | −3.316 | 0.001 | −0.005 | ||
| CogMech | 3.804 | 0.000 | 0.024 | Interjunction | 2.282 | 0.023 | 0.008 | ||
| Cause | 2.524 | 0.012 | 0.005 | TenseM | −2.804 | 0.005 | −0.007 | ||
| Discrep | 3.044 | 0.002 | 0.009 | ProgM | −2.753 | 0.006 | −0.004 | ||
| Tentat | 2.203 | 0.028 | 0.005 | Personal | Relative | −2.977 | 0.003 | −0.020 | |
| Inclusive | 3.436 | 0.001 | 0.009 | Space | −4.085 | 0.000 | −0.023 | ||
| See | −2.183 | 0.029 | −0.002 | Leisure | −4.559 | 0.000 | −0.012 | ||
| Bio | 2.885 | 0.004 | 0.019 | General Description | Qmark | 5.778 | 0.000 | 0.007 | |
| Health | 5.764 | 0.000 | 0.030 | WordPerSentence | −2.347 | 0.019 | −1.719 | ||
| Ingest | −3.303 | 0.001 | −0.011 | RateSixLtrWord | 2.360 | 0.019 | 0.002 | ||
MD stands for Mean-difference, where a positive value indicates that the characteristics in truth has a higher value than in misinformation, while a negative value indicates the opposite. Considering the limited space, insignificant characteristics are not listed. The meanings of each characteristics can be referred to Tausczik and Pennebaker (.
Full name, category and word examples of the identified characteristics.
|
|
|
|
|
|---|---|---|---|
| Affect | Affective process | Psychological characteristics | happy, cried, abandon |
| AuxVerb | Auxiliary verbs | Linguistic characteristics | am, will, have |
| Bio | Biological process | Psychological characteristics | eat, blood, pain |
| Cause | Causation | Psychological characteristics | because, effect, hence |
| CogMech | Cognitive process | Psychological characteristics | cause, know, ought |
| Discrep | Discrepancy | Psychological characteristics | should, would, could |
| Friend | Friend | Psychological characteristics | buddy, friend, neighbor |
| Health | Health | Psychological characteristics | clinic, flu, pill |
| Inclusive | Inclusive | Psychological characteristics | and, with, include |
| Ingest | Ingestion | Psychological characteristics | dish, eat, pizza |
| Insight | Insight | Psychological characteristics | think, know, consider |
| Interjunction | Interjunction | Linguistic characteristics | and, but, whereas |
| Leisure | Leisure | Personal | cook, chat, movie, apartment, kitchen, |
| Number | Number | Linguistic characteristics | second, thousand |
| ProgM | Progressive marks | Linguistic characteristics | —— |
| Qmark | Question marks | General description | —— |
| RateSixLtrWord | Words>6 letters | General description | —— |
| Relative | Relativity | Personal | area, bend, exit, stop |
| Religion | Religion | Personal | altar, church, mosque |
| See | See | Psychological characteristics | view, saw, seen |
| Social | Social process | Psychological characteristics | mate, talk, they, child |
| Space | Space | Personal | down, in, thin |
| TenseM | Tense marks | Linguistic characteristics | —— |
| Tentat | Tentative | Psychological characteristics | maybe, perhaps, guess |
| Time | Time | Personal | end, until, season |
| Verb | Common verbs | Linguistic characteristics | walk, went, see |
| WordPerSentence | Words per sentence | General Description | —— |
| Work | Work | Personal | job, majors, xerox |
F-test results for rumors across different stages.
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| CogMech | 4 | 1 | −0.030 | 0.000 | Health | 3 | 1 | −0.019 | 0.012 |
| 2 | −0.041 | 0.000 | 4 | 0.017 | 0.033 | ||||
| 3 | −0.025 | 0.010 | 4 | 1 | −0.035 | 0.000 | |||
| 5 | −0.031 | 0.002 | 2 | −0.028 | 0.000 | ||||
| Insight | 5 | 1 | 0.010 | 0.000 | 3 | −0.017 | 0.033 | ||
| 2 | 0.009 | 0.003 | 5 | 1 | −0.033 | 0.000 | |||
| 3 | 0.009 | 0.005 | 2 | −0.026 | 0.002 | ||||
| 4 | 0.008 | 0.012 | Space | 2 | 4 | −0.022 | 0.005 | ||
| Discrep | 1 | 2 | −0.006 | 0.100 | 5 | −0.036 | 0.000 | ||
| 3 | 0.010 | 0.015 | 5 | 1 | 0.023 | 0.010 | |||
| 4 | 0.019 | 0.000 | 2 | 0.036 | 0.000 | ||||
| 5 | 0.015 | 0.001 | Time | 4 | 1 | 0.017 | 0.000 | ||
| 2 | 1 | 0.006 | 0.100 | 5 | 1 | 0.022 | 0.000 | ||
| 3 | 0.016 | 0.000 | 2 | 0.014 | 0.011 | ||||
| 4 | 0.025 | 0.000 | 3 | 0.014 | 0.027 | ||||
| 5 | 0.021 | 0.000 | Work | 1 | 3 | −0.014 | 0.049 | ||
| 3 | 1 | −0.010 | 0.015 | 4 | −0.045 | 0.000 | |||
| 2 | −0.016 | 0.000 | 5 | −0.017 | 0.026 | ||||
| 4 | 0.009 | 0.033 | 4 | 1 | 0.045 | 0.000 | |||
| Bio | 3 | 1 | −0.025 | 0.010 | 2 | 0.035 | 0.000 | ||
| 2 | −0.020 | 0.047 | 3 | 0.031 | 0.000 | ||||
| 4 | 0.027 | 0.007 | 5 | 0.028 | 0.000 | ||||
| 4 | 1 | −0.052 | 0.000 | Religion | 5 | 1 | 0.005 | 0.000 | |
| 2 | −0.047 | 0.000 | 2 | 0.006 | 0.000 | ||||
| 3 | −0.027 | 0.007 | 3 | 0.006 | 0.000 | ||||
| 5 | 1 | −0.044 | 0.000 | 4 | 0.005 | 0.000 | |||
| 2 | −0.039 | 0.000 |
I and J represent the two stages compared, where 1 for Initial stage (Jan 18–Feb 5), 2-Outbreak stage (Feb 6–Feb 21), 3-Plateau stage (Feb 22–Mar 8), 4-Recession stage (Mar 9-May7), 5-Regular Control stage (May 8–Oct 2). MD(I-J) is the mean-difference between I and J phases, where a positive value means the value of I stage is larger than J stage, and a negative value is the opposite.