| Literature DB >> 32921839 |
Huosong Xia1,2, Wuyue An1, Jiaze Li3, Zuopeng Justin Zhang4.
Abstract
Based on complex adaptive system theory and information theory for investigating heterogeneous situations, this paper develops an outlier knowledge management framework based on three aspects-dimension, object, and situation-for dealing with extreme public health events. In the context of the COVID-19 pandemic, we apply advanced natural language processing (NLP) technology to conduct data mining and feature extraction on the microblog data from the Wuhan area and the imported case province (Henan Province) during the high and median operating periods of the epidemic. Our experiment indicates that the semantic and sentiment vocabulary of words, the sentiment curve, and the portrait of patients seeking help were all heterogeneous in the context of COVID-19. We extract and acquire the outlier knowledge of COVID-19 and incorporate it into the outlier knowledge base of extreme public health events for knowledge sharing and transformation. The knowledge base serves as a think tank for public opinion guidance and platform suggestions for dealing with extreme public health events. This paper provides novel ideas and methods for outlier knowledge management in healthcare contexts.Entities:
Keywords: Analysis of public opinion; COVID-19; Governance suggestion; Natural language processing; Outlier knowledge management
Year: 2020 PMID: 32921839 PMCID: PMC7477628 DOI: 10.1016/j.seps.2020.100941
Source DB: PubMed Journal: Socioecon Plann Sci ISSN: 0038-0121 Impact factor: 4.923
Fig. 1Theoretical foundation.
Fig. 2COVID-19 public opinion analysis research framework.
Fig. 3BERT input representation.
Fig. 4BERT + LDA model framework.
Comparison of evaluation criteria of different models.
| Accuracy | Recall | Precision | |
|---|---|---|---|
| Word2vec | 0.847 | 0.86 | 0.903 |
| BERT | 0.899 | 0.9 | 0.945 |
Important feature words in blog posts with different emotional orientations.
| Negative sentiment tendency | Positive sentiment tendency | ||||||
|---|---|---|---|---|---|---|---|
| Outpatient | parents | drug | diagnosis | state department | strengthen | farewell | charter |
| incubation period | peak | panic | runaway | evacuation | donation | hardcore | Salute |
| lockdown | wildlife | College Entrance Examination | closed | immunity | dream | go to | warrior |
| discrimination | newly increased | progress | help | encourage | no new | retrograde | dedication |
| start schoo | isolation | online courses | online | work together | hope | sunshine | normal |
| stagnation | resources | close | claim | cure | hold on | uphold | stick to |
| difficult | fever | difficulty breathing | mask | through | help | control | inspect |
| incubation | cycle | layoffs | rumors | free access | spend | movement | transparent |
| Ignore | reservation | sleepy | fire | decline | support | evacuate | donate |
| shutdown | home quarantine | serious | deficient | prevention | Mourn | silent | role model |
| Contact | track events | risk level | for long | volunteer | closed city | technology | back to school |
KL values of different models (Wuhan citizens' microblog).
| High operating period | Median operating period | |||||||
|---|---|---|---|---|---|---|---|---|
| Topics number | 3 | 4 | 5 | 6 | 3 | 4 | 5 | 6 |
| BERT-LDA | 8.71 | 8.16 | 7.03 | 8.63 | 8.08 | 7.15 | ||
| Lda2vec | 8.96 | 8.34 | 7.95 | 7.48 | 8.15 | 8.52 | 7.86 | 7.24 |
Topic distribution of Wuhan citizens' microblog.
| Topic | High operating period | Median operating period | |||||
|---|---|---|---|---|---|---|---|
| Material demand | Daily life | Epidemic protection | Material demand | Daily life | Epidemic protection | Emotion | |
| takeout, supplies, vegetable, | go out, | epidemic situation, | food, | go out, | epidemic situation, masks, diagnosis, hospitals, COVID-19, disinfection, medical staff, temperature taking, prevention and control, epidemic areas | Boring, hard to buy, happy, like, ok, | |
Average KL distance between any two topics (Provinces of imported cases).
| High operating period | Median operating period | |||||||
|---|---|---|---|---|---|---|---|---|
| Topics number | 3 | 4 | 5 | 6 | 3 | 4 | 5 | 6 |
| BERT-LDA | 9.21 | 9.63 | 8.76 | 12.56 | 12.00 | 11.25 | ||
| Lda2vec | 8.63 | 9.77 | 8.76 | 7.94 | 11.86 | 12.03 | 11.78 | 10.98 |
Topic distribution of citizens' microblog (Provinces of imported cases).
| Topic | High operating period | Median operating period | ||||||
|---|---|---|---|---|---|---|---|---|
| Returning to work and school | Food and entertainment | Epidemic prevention | Emotional attitude | Returning to work and school | Epidemic prevention | Emotional attitude | Closed city | |
| Key words | school, | Hotpot, | epidemic situation, prevention and control, medical treatment, masks, anti-epidemic, | sunny, spring, bored, scared, painful, lonely, beautiful, happy, cute, happy | health certificate, certificate, processing, office, | Zero growth, improvement, isolation, cure, | steady, | fully closed, advised to return, unable to enter, cleared, opened to traffic, returned to normal |
Fig. 5Sentiments tendency.
Fig. 6Visualization of the portrait of patients.