| Literature DB >> 35016072 |
K Hayawi1, S Shahriar2, M A Serhani3, I Taleb2, S S Mathew2.
Abstract
OBJECTIVES: COVID-19 (SARS-CoV-2) pandemic has infected hundreds of millions and inflicted millions of deaths around the globe. Fortunately, the introduction of COVID-19 vaccines provided a glimmer of hope and a pathway to recovery. However, owing to misinformation being spread on social media and other platforms, there has been a rise in vaccine hesitancy which can lead to a negative impact on vaccine uptake in the population. The goal of this research is to introduce a novel machine learning-based COVID-19 vaccine misinformation detection framework. STUDYEntities:
Keywords: COVID-19; Deep learning; Misinformation detection; Natural language processing; Text classification; Vaccines
Mesh:
Substances:
Year: 2021 PMID: 35016072 PMCID: PMC8648668 DOI: 10.1016/j.puhe.2021.11.022
Source DB: PubMed Journal: Public Health ISSN: 0033-3506 Impact factor: 2.427
Existing works in COVID-19 misinformation and COVID-19 vaccine tweets.
| Source | Application | Dataset | Available online | Prediction results |
|---|---|---|---|---|
| Misinformation dataset, analysis, and classification | Social media and website misinformation regarding COVID-19 | ✓ | F1-score: 0.58 using hierarchical attention network–based model | |
| Misinformation dataset and analysis | Annotated COVID-19 misinformation tweets | ✓ | N/A | |
| Reliable and unreliable news dataset, analysis, and prediction | News articles and their credibility level as well as tweets related to their spread | ✓ | F1-scores: 0.83 and 0.67 for reliable and unreliable news detection, respectively, using neural networks | |
| Large COVID-19 tweets dataset, analysis, and classification | Tweets related to COVID-19 in more than 100 languages from 268 countries | ✓ | F1-score: 0.98 for COVID-relevant tweets using the transformer-based masked language model | |
| COVID-19 misinformation detection | Combination of various existing tweets datasets related to COVID-19, disasters, news, and gossip | ✕ | F1-score: 0.985 using LSTM | |
| COVID-19 misinformation detection in Arabic | Arabic tweets related to COVID-19 | ✓ | F1-score: 0.74 using MARABERT | |
| COVID-19 misinformation detection in English and Arabic | English and Arabic tweets related to COVID-19 | ✓ | Not presented | |
| COVID-19 fake news detection in Chinese | Chinese microblog posts from Weibo | ✓ | F1-score: 0.94 using TextCNN | |
| COVID-19 anti-vaccine tweets dataset and analysis | Tweets exhibiting anti-vaccine stance collected using keywords | ✓ | N/A | |
| COVID-19 anti-vaccine tweets analysis | COVID-19 vaccine tweets collected using keywords | ✕ | N/A | |
| COVID-19 vaccine tweets analysis | COVID-19 vaccine tweets collected using keywords | ✓ | N/A | |
| COVID-19 vaccine tweets sentiment analysis | COVID-19 vaccine tweets collected using keywords | ✓ | N/A | |
| COVID-19 vaccine tweets sentiment analysis | COVID-19 vaccine tweets collected using keywords | ✕ | N/A |
Fig. 1Number of vaccine tweets by month.
Fig. 2Word cloud visualization for vaccine misinformation tweets.
Fig. 3Word cloud visualization for general vaccine-related tweets.
Fig. 4Research framework. tf-idf, term frequency-inverse document frequency.
Fig. 5Confusion matrix on the test set using XGBoost.
Fig. 6Training and validation accuracies using LSTM.
Fig. 7Confusion matrix on the test set using LSTM.
Fig. 8Training and validation accuracies using BERT.
Fig. 9Confusion matrix on the test set using BERT.
Performance comparison with related COVID-19 misinformation detection works.
| Source | Classification | F1-score |
|---|---|---|
| COVID-19 misinformation | 0.58 | |
| COVID-19 news reliability detection | 0.83 and 0.67 | |
| COVID-19 misinformation | 0.92 | |
| COVID-19 misinformation | 0.985 | |
| Arabic COVID-19 misinformation | 0.74 | |
| Chinese COVID-19 misinformation | 0.94 | |