| Literature DB >> 35722449 |
Md Sabab Zulfiker1, Nasrin Kabir2, Al Amin Biswas1, Sunjare Zulfiker3, Mohammad Shorif Uddin2.
Abstract
Since December 2019, the world has been fighting against the COVID-19 pandemic. This epidemic has revealed a bitter truth that though humans have advanced to unprecedented heights in the last few decades in terms of technology, they are lagging far behind in the fields of medical science and health care. Several institutes and research organizations have stepped up to introduce different vaccines to combat the pandemic. Bangladesh government has also taken steps to provide widespread vaccinations from January 2021. The Bangladeshi netizens are frequently sharing their thoughts, emotions, and experiences about the COVID-19 vaccines and the vaccination process on different social media sites like Facebook, Twitter, etc. This study has analyzed the views and opinions that they have expressed on different social media platforms about the vaccines and the ongoing vaccination program. For performing this study, the reactions of the Bangladeshi netizens on social media have been collected. The Latent Dirichlet Allocation (LDA) model has been used to extract the most common topics expressed by the netizens regarding the vaccines and vaccination process in Bangladesh. Finally, this study has applied different deep learning as well as traditional machine learning algorithms to identify the sentiments and polarity of the opinions of the netizens. The performance of these models has been assessed using a variety of metrics such as accuracy, precision, sensitivity, specificity, and F1-score to identify the best one. Sentiment analysis lessons from these opinions can help the government to prepare itself for the future pandemic.Entities:
Keywords: COVID-19; Deep learning; Public sentiment; Social media; Traditional machine learning; Vaccination
Year: 2022 PMID: 35722449 PMCID: PMC9188682 DOI: 10.1016/j.array.2022.100204
Source DB: PubMed Journal: Array (N Y) ISSN: 2590-0056
Fig. 1Monthly percentage of the collected opinions.
Fig. 2Percentage of the collected opinions by account type.
Inter-rater reliability test scores.
| Test Name | Reliability Score | 95% Confidence Interval (95% CI) | |
|---|---|---|---|
| Lower Bound | Upper Bound | ||
| Krippendorff's Alpha Reliability Test | 0.8473 | 0.8405 | 0.8539 |
| Fleiss' Kappa Reliability Test | 0.8470 | 0.8340 | 0.8600 |
Fig. 3Monthly percentage of the polarity of the opinions.
Fig. 4Sentiment dynamics of every ten (10) days.
Fig. 5Ten (10) day moving average of the positive and negative opinions.
Fig. 6Word clouds.
Perplexity scores for different number of topics in the corpus of the positive and negative statements.
| Number of Topics | Perplexity Score for Corpus of the Positive Statements | Perplexity Score for Corpus of the Negative Statements |
|---|---|---|
| 2 | −6.04 | −6.11 |
| 3 | −6.10 | −6.21 |
| 4 | −6.20 | −6.30 |
| 5 | −6.26 | −6.36 |
Top five topics of the positive corpus.
| Topic 1 | Topic 2 | Topic 3 | Topic 4 | Topic 5 |
|---|---|---|---|---|
| vaccines | Vaccine | vaccine | no | vaccine |
| bangladesh | Bangladesh | good | side | bangladesh |
| vaccine | Vaccines | news | effects | dose |
| best | doses | us | everyone | great |
| minister | china | people | alhamdulillah | got |
| hope | good | bangladesh | took | vaccinated |
| bangavax | million | country | vaccine | vaccination |
| great | covid19 | biotech | get | pfizer |
| see | sinopharm | also | medical | second |
| taking | moderna | globe | parents | thank |
| us | government | government | bangladesh | doses |
| initiative | covid | thanks | fine | news |
| appreciate | us | allah | vaccinated | people |
| countries | pfizer | need | know | first |
| congratulations | covax | get | vaccination | soon |
Top five topics of the negative corpus.
| Topic 1 | Topic 2 | Topic 3 | Topic 4 | Topic 5 |
|---|---|---|---|---|
| vaccine | vaccines | vaccine | not | vaccine |
| people | bangladesh | not | vaccine | bangladesh |
| bangladesh | china | take | vaccines | vaccines |
| not | no | people | people | not |
| dose | vaccine | business | vaccination | get |
| india | price | dose | fever | money |
| first | get | indian | still | indian |
| corona | india | need | making | india |
| got | minister | india | price | government |
| urine | trial | taking | body | govt |
| cow | know | covid | u | us |
| astrazeneca | people | bangladesh | chinese | time |
| even | politics | no | pain | people |
| vaccination | sinopharm | wo | headache | countries |
| us | vaccinated | serum | countries | give |
Fig. 7Implementation procedures for predicting the polarity of the statements.
Fig. 8Architecture of the proposed deep learning models.
Confusion matrices of the proposed deep learning models.
| Model | Pre-trained Word Embedding | TPPolarity | TNPolarity | FPPolarity | FNPolarity |
|---|---|---|---|---|---|
| LSTM | word2vec | 112 | 73 | 20 | 10 |
| GloVe | 108 | 75 | 18 | 14 | |
| Bi LSTM | word2vec | 108 | 80 | 13 | 14 |
| GloVe | 113 | 71 | 22 | 9 | |
| 1D-CNN | word2vec | 109 | 74 | 19 | 13 |
| GloVe | 114 | 67 | 26 | 8 | |
| TCN | word2vec | 108 | 78 | 15 | 14 |
| GloVe | 105 | 79 | 14 | 17 |
Confusion matrices of the proposed traditional machine learning models.
| Model | TPPolarity | TNPolarity | FPPolarity | FNPolarity |
|---|---|---|---|---|
| DT | 75 | 37 | 56 | 47 |
| GB | 89 | 41 | 52 | 33 |
| SVM | 97 | 40 | 53 | 25 |
Performance metrics of the proposed deep learning models.
| Model | Pre-trained Word Embedding | Accuracy | Precision | Sensitivity | Specificity | F1- score |
|---|---|---|---|---|---|---|
| LSTM | word2vec | 86.05% | 84.85% | 91.80% | 78.49% | 88.19% |
| GloVe | 85.12% | 85.71% | 88.52% | 80.65% | 87.10% | |
| Bi LSTM | word2vec | 87.44% | 89.26% | 88.52% | 86.02% | 88.89% |
| GloVe | 85.58% | 83.70% | 92.62% | 76.34% | 87.94% | |
| 1D-CNN | word2vec | 85.12% | 85.16% | 89.34% | 79.57% | 87.20% |
| GloVe | 84.19% | 81.43% | 93.44% | 72.04% | 87.02% | |
| TCN | word2vec | 86.51% | 87.80% | 88.52% | 83.87% | 88.16% |
| GloVe | 85.58% | 88.24% | 86.07% | 84.95% | 87.14% |
Performance metrics of the proposed traditional machine learning models.
| Model | Accuracy | Precision | Sensitivity | Specificity | F1- score |
|---|---|---|---|---|---|
| DT | 52.09% | 57.25% | 61.48% | 39.78% | 59.29% |
| GB | 60.47% | 63.12% | 72.95% | 44.09% | 67.68% |
| SVM | 63.72% | 64.67% | 79.51% | 43.01% | 71.32% |
Fig. 9ROC curves for deep learning models.
Fig. 10ROC curves for traditional machine learning models.