| Literature DB >> 36011450 |
Jinhai Li1, Yunlei Ma2, Xinglong Xu3, Jiaming Pei4, Youshi He3.
Abstract
The outbreak of the coronavirus disease 2019 (COVID-19) represents an alert for epidemic prevention and control in public health. Offline anti-epidemic work is the main battlefield of epidemic prevention and control. However, online epidemic information prevention and control cannot be ignored. The aim of this study was to identify reliable information sources and false epidemic information, as well as early warnings of public opinion about epidemic information that may affect social stability and endanger the people's lives and property. Based on the analysis of health and medical big data, epidemic information screening and public opinion prevention and control research were decomposed into two modules. Eight characteristics were extracted from the four levels of coarse granularity, fine granularity, emotional tendency, and publisher behavior, and another regulatory feature was added, to build a false epidemic information identification model. Five early warning indicators of public opinion were selected from the macro level and the micro level to construct the early warning model of public opinion about epidemic information. Finally, an empirical analysis on COVID-19 information was conducted using big data analysis technology.Entities:
Keywords: COVID-19; early warning; epidemic; health and medical big data; public opinion; subject matrix; xgboost
Mesh:
Year: 2022 PMID: 36011450 PMCID: PMC9408673 DOI: 10.3390/ijerph19169819
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Identification features of false epidemic information.
| Feature Level | Feature Number | Feature Identification |
|---|---|---|
| Coarse granularity | Subject relevance | |
| Fine granularity level | Number of feature words | |
| Number of clauses | ||
| Effective length of information | ||
| Emotional orientation | Emotional intensity | |
| Publisher behavior | Publisher’s name | |
| Level of publisher | ||
| Amount of information published by the publisher | ||
| Regulatory feature | The release of historical false information by the publisher |
Figure 1The identification process of fake epidemic information based on the XGBoost algorithm.
Figure 2Implementation code of false epidemic information identification based on classifier XGBClassifier.
Figure 3The specific working process of the epidemic information public opinion early warning model.
Early warning indicator system of epidemic information public opinion.
| Indicator Hierarchy | Indicator Serial Number | Indicator Influencing Factors | Indicator Description |
|---|---|---|---|
| Macro level | Change rate of epidemic information quantity | This indicator reflects the changing trend of the amount of epidemic information per unit of time. If the change rate is positive, this indicates that the number of people paying attention to the epidemic continues to increase; if the change rate is negative, this indicates that the number of people paying attention to the epidemic continues to decrease. | |
| Regional coverage rate of epidemic information | This indicator reflects the regional distribution involved in the epidemic information. If the coverage rate is high, this indicates that the epidemic situation is a concern in many regions. | ||
| Micro level | Emotional tendency of epidemic information | This indicator reflects the public’s attitude toward the epidemic situation. If the emotional tendency continues to increase, this indicates that the public has a positive attitude toward the epidemic situation. | |
| Concentration degree of epidemic information subject | This indicator reflects the extent to which some subjects of the epidemic are a concern. If the concentration is high, this means that most people are more concerned about some aspect of the epidemic situation. | ||
| New subject regarding epidemic information | This indicator reflects a subject of epidemic information that has not been a concern before. If a new subject emerges, this means that the epidemic has changed and attracted wide public attention. |
Partial webpages of reliable epidemic data sources (accessed on 31 March 2020).
| ID | URL |
|---|---|
| 6 |
|
| 32 |
|
| … | … |
Partial text information of effective epidemic information.
| ID | URL |
|---|---|
| 4 | Will the spread of COVID-19 become “prolonged”... |
| 156 | The COVID-19 pandemic, scientists have imagined five ways it could end ... |
| … | … |
Figure 4Change rate of epidemic information quantity.
Figure 5Number of confirmed cases in the United States from 10 March to 28.
Figure 6Regional coverage of epidemic information in China.
Figure 7Public opinion indicator value at the micro level.
Figure 8The accuracy comparison of the five models.
Figure 9Execution time comparison of the five models.