| Literature DB >> 36186808 |
Gayeong Eom1, Sanghyun Yun2, Haewon Byeon2,3.
Abstract
Although the full vaccination rate of South Korea compared to other countries, concerns about the effectiveness of the vaccine are growing as new COVID variants such as Alpha, Beta, Gamma, Delta, and Omicron appear over time. In this study, we collected Twitter data in South Korea that contained keywords like vaccines after the outbreak of the Omicron variant from 27 November 2021 to 14 February 2022. First, we analyzed the relationship between potential keywords associated with vaccination after the appearance of the Omicron variant in Twitter using network analysis. Second, we developed an efficient model for predicting the emotion of speech regarding vaccination after the COVID-19 Omicron variant pandemic by using deep learning algorithms. We constructed sentiment analysis models regarding vaccination after the COVID-19 Omicron pandemic by using five algorithms [i.e., support vector machine (SVM), recurrent neural networks (RNNs), long short-term memory models (LSTMs), bidirectional encoder representations from transformers (BERT), and Korean BERT (KoBERT)]. The results confirmed that KoBERT showed the best performance (71%) in all predictive performance indicators (accuracy, precision, and F1 score). It is necessary to prepare measures to alleviate the negative factorss of the public about vaccination in the long-term pandemic situation and help the public recognize the efficacy and safety of vaccination by using big data based on the results of this study.Entities:
Keywords: BERT; COVID-19 Omicron variant; NLP; deep learning; sentiment analysis
Year: 2022 PMID: 36186808 PMCID: PMC9515356 DOI: 10.3389/fmed.2022.948917
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
FIGURE 1Trend in the volume of keyword search related to the Omicron variant in South Korean news media (27 November 2021—14 February 2022).
FIGURE 2Schematic diagram of this study.
FIGURE 3RNN structure.
FIGURE 4LSTM structure.
FIGURE 5BERT pre-training and fine-tuning procedures.
Results of frequency analysis of top 30 keywords and TF-IDF.
| Keywords | Frequency | Keywords | TF-IDF | |
| 1 | Omicron | 8,279 | Inoculation | 3286.131 |
| 2 | Vaccine | 7,997 | COVID-19 | 2777.284 |
| 3 | Inoculation | 3,246 | Infection | 2610.349 |
| 4 | COVID-19 | 1,860 | Effect | 2197.277 |
| 5 | Infection | 1,508 | Vaccine booster | 1998.605 |
| 6 | Effect | 1,129 | Proof of vaccination | 1890.749 |
| 7 | Vaccine booster | 965 | Spread | 1781.616 |
| 8 | Spread | 825 | Disease control | 1757.968 |
| 9 | Proof of vaccination | 820 | Confirmed case | 1670.345 |
| 10 | Disease control | 719 | Pfizer | 1640.591 |
| 11 | Pfizer | 703 | Research | 1463.721 |
| 12 | Confirmed case | 691 | Delta | 1361.974 |
| 13 | Research | 573 | Prevention | 1284.294 |
| 14 | Delta | 507 | Cold | 1253.636 |
| 15 | Prevention | 464 | Symptom | 1151.075 |
| 16 | Cold | 421 | Immunity | 1126.171 |
| 17 | Symptom | 385 | Unvaccinated people | 1104.679 |
| 18 | Unvaccinated people | 378 | Mask | 1093.829 |
| 19 | Immunity | 367 | Government | 1077.68 |
| 20 | Government | 363 | Treatment | 1045.194 |
| 21 | Mask | 361 | Antibody | 1030.577 |
| 22 | Treatment | 331 | Confirmed case | 989.9985 |
| 23 | Antibody | 320 | Result | 989.3775 |
| 24 | Result | 320 | Severe symptom | 951.5444 |
| 25 | Confirmed case | 316 | Examination | 911.4365 |
| 26 | Severe symptom | 301 | Response | 866.7388 |
| 27 | Examination | 267 | Breakthrough infection | 859.0066 |
| 28 | Response | 261 | Death | 837.5117 |
| 29 | Breakthrough infection | 259 | Completion | 827.4268 |
| 30 | Death | 247 | Occurrence | 822.6 |
Results of LDA topic modeling.
| Topic | Topic name | Keywords |
| 1 | Omicron | COVID-19, proof of vaccination, influenza, cold, government, disease control, fear, fatality, entry, and quarantine |
| 2 | Vaccine | Effect, inoculation, Pfizer, vaccine booster, COVID-19, prevention, antibody, research, variant, and Delta |
| 3 | Vaccine Inequality | Research, worldwide, symptom, treatment, spread, COVID-19, inequality, helplessness, concern, and response |
| 4 | Breakthrough Infection | Vaccinated people, infection, confirmed case, spread, disease control, COVID-19, definite diagnosis, unvaccinated people, breakthrough infection, and outbreak |
FIGURE 6Visualization of network analysis results.
FIGURE 7Visualization of CONCOR analysis results.
Keyword factor types related to Omicron and vaccine based on CONCOR analysis.
| Cluster name | Keywords | |
| 1 | Omicron and vaccination status | Omicron, proof of vaccination, COVID-19, variant, immunity, treatment, definite diagnosis, vaccine, possibility, fatality, inequality, government, symptom, spread, cold, worldwide, present, inoculation, response, domestic, mask, Delta, side effects, and influenza |
| 2 | Infection and treatment | Medical care, quarantine, hospitalization, situation, examination, breakthrough infection, infection, risk, patient, disease control, and unvaccinated people |
| 3 | Vaccine effectiveness and the need for vaccine research | Prevention, need, consequence, death, effect, Pfizer, research, antibody, vaccine booster, and severe symptom |
| 4 | Increase in confirmed cases and deaths | Increase, occurrence, death (the death toll), confirmed case, and completion |
FIGURE 8Frequency of positive labels and negative labels.
Performance evaluation results of five sentiment classification models.
| SVM | RNN | LSTM | BERT | KoBERT | |
| Accuracy | 57.677 | 62.241 | 62.897 | 67.182 | 71.541 |
| Precision | 56.312 | 61.027 | 60.998 | 66.445 | 71.016 |
| F1-score | 54.991 | 59.971 | 60.212 | 65.612 | 70.831 |
FIGURE 9Sentiment prediction result for new speech input of the final sentiment prediction model (KoBERT).