| Literature DB >> 36159189 |
Bhavani Devi Ravichandran1, Pantea Keikhosrokiani1.
Abstract
The spread of Covid-19 misinformation on social media had significant real-world consequences, and it raised fears among internet users since the pandemic has begun. Researchers from all over the world have shown an interest in developing deception classification methods to reduce the issue. Despite numerous obstacles that can thwart the efforts, the researchers aim to create an automated, stable, accurate, and effective mechanism for misinformation classification. In this paper, a systematic literature review is conducted to analyse the state-of-the-art related to the classification of misinformation on social media. IEEE Xplore, SpringerLink, ScienceDirect, Scopus, Taylor & Francis, Wiley, Google Scholar are used as databases to find relevant papers since 2018-2021. Firstly, the study begins by reviewing the history of the issues surrounding Covid-19 misinformation and its effects on social media users. Secondly, various neuro-fuzzy and neural network classification methods are identified. Thirdly, the strength, limitations, and challenges of neuro-fuzzy and neural network approaches are verified for the classification misinformation specially in case of Covid-19. Finally, the most efficient hybrid method of neuro-fuzzy and neural networks in terms of performance accuracy is discovered. This study is wrapped up by suggesting a hybrid ANFIS-DNN model for improving Covid-19 misinformation classification. The results of this study can be served as a roadmap for future research on misinformation classification.Entities:
Keywords: ANFIS; Covid-19; DNN; Misinformation classification; Neural network; Neuro-fuzzy
Year: 2022 PMID: 36159189 PMCID: PMC9488884 DOI: 10.1007/s00521-022-07797-y
Source DB: PubMed Journal: Neural Comput Appl ISSN: 0941-0643 Impact factor: 5.102
Fig. 1ANFIS architecture
Fig. 2A DNN with multiple hidden layers
Summary of reviewed papers
| Study | Year | Problem tackled | Method | Dataset | Database/Library | |||
|---|---|---|---|---|---|---|---|---|
| NF | NN | NLP | Machine learning | |||||
| [ | 2018 | Classification | ✓ | X | X | X | Medical dataset | Taylor & Francis |
| [ | 2018 | Classification | ✓ | X | X | X | Brain images | ScienceDirect |
| [ | 2018 | Detection | X | ✓ | X | X | Google Scholar (Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics) | |
| [ | 2018 | Classification | X | ✓ | ✓ | X | Liar, Kaggle | Wiley |
| [ | 2018 | Classification | ✓ | X | X | X | SpringerLink | |
| [ | 2019 | Classification | ✓ | X | X | X | – | ScienceDirect |
| [ | 2019 | Identification & Classification | X | ✓ | X | DSSM | LIAR | Taylor & Francis |
| [ | 2019 | Classification | X | ✓ | X | X | Sentimental Text | IEEE |
| [ | 2019 | Classification | ✓ | X | X | X | Google Scholar (International Journal of Intelligent Engineering & Systems) | |
| [ | 2019 | Classification | X | X | X | SVM, KNN, Naïve Bayes, AdaBoost, Bagging | Taylor & Francis | |
| [ | 2019 | Classification | ✓ | X | X | X | Social Media | IEEE |
| [ | 2020 | Classification | ✓ | X | X | X | Power transformers faults | ScienceDirect |
| 2020 | Detection | |||||||
| [ | 2020 | Detection | X | ✓ | X | X | Kaggle | ScienceDirect |
| [ | 2020 | Classification | ✓ | X | X | X | – | Google Scholar ( |
| [ | 2020 | Classification | ✓ | ✓ | X | X | – | IEEE |
| [ | 2020 | Detection | X | X | X | Stochastic gradient descent | Data set of Bengali language | IEEE |
| [ | 2020 | Detection | X | X | X | 23 Machine Languages | BuzzFeed, Political News, ISOT Fake news | ScienceDirect |
| [ | 2020 | Detection | X | ✓ | X | X | Facebook, Twitter, Weibo | SpringerLink |
| [ | 2020 | Prediction | ✓ | X | X | X | – | IEEE |
| [ | 2020 | Classification | ✓ | X | X | X | Medical dataset | IEEE |
| [ | 2020 | Detection | X | X | X | DT, kNN, LR, LSVM, MNB, BNB, NN, ERF, and XGBoost | IEEE | |
| [ | 2020 | Detection & Prediction | X | X | X | Logistic Regression, Support Vector Classification, and Naïve Bayes | Scopus | |
| [ | 2021 | Verification | X | X | X | ML | Twitter, Reddit, Bing | ScienceDirect |
| [ | 2021 | Detection | X | X | ✓ | X | CNN, WHO, CDCP | ScienceDirect |
| [ | 2021 | Classification | X | X | X | Random Forest | Scopus | |
| [ | 2021 | Detection | X | X | X | ML procedures | – | IEEE |
| [ | 2021 | Detection | X | ✓ | X | SVM | FakeDataNews | Wiley |
| [ | 2021 | Classification | X | ✓ | X | SVM | Mendeley Data | ScienceDirect |
| [ | 2021 | Detection | X | X | ✓ | X | Thai Text | Scopus |
| [ | 2021 | Classification | ✓ | X | X | X | Medical dataset | Google Scholar |
| [ | 2021 | Classification | X | ✓ | X | X | Academic dataset | ScienceDirect |
| [ | 2021 | Classification | ✓ | X | X | X | Medical dataset | Wiley |
| [ | 2021 | Detection & Prediction | X | ✓ | ✓ | X | Scopus | |
| [ | 2021 | Detection | X | ✓ | X | X | Scopus | |
Keyword and synonyms
| Keyword | Synonyms |
|---|---|
| Covid-19 | Corona, Coronavirus, Covid, Covid-2019, Novel Coronavirus Illness, Wuhan coronavirus, Coronavirus diseases |
| Misinformation | Disinformation, False News, Rumours, False Rumour, False Information, Untruth, Falsity, Misreport, Misstatement, Deception |
| Social Media | Online, Social Platform, Social Site, Social Web, Multimedia, Media, Media Platform, Public Network |
| Artificial Intelligent | Neuro-fuzzy, Neural Network, Adaptive Neuro-based Fuzzy Inference System, ANFIS, Deep Neural Network, DNN |
| Detection | Observation, Identification, Spotting, Recognition, Diagnosis, Sensing |
| Classification | Categorization, Grouping, Grading, Ranking, Sorting, Systematization |
| Prediction | Forecasting, Divination, Augury, Projection, Prognosis, Guess |
Search strategy decisions
| Searched library | IEEE Xplore, SpringerLink, ScienceDirect, Scopus, Taylor & Francis, Wiley, Google Scholar |
| Searched papers | Journal papers, conference papers |
| Search applied on | Available full-text articles Attempt to avoid excluding articles that lack keywords in their title or abstract but are still applicable to the literature |
| Publication period | Since June 2018 |
Search limits on searched databases
| Library | Limits | Returned papers |
|---|---|---|
| IEEE | 2018–2021, English, Conference, Journals, Keywords | 15 |
| Science Direct | 2018–2021, English, Conference, Journals, Computer Science | 43 |
| Scopus | 2018–2021, English, Conference, Journals, Computer Science, Keywords | 22 |
| Spring | 2018–2021, English, Conference, Journals, Keywords | 86 |
| Taylor & Francis | 2018–2021, English, Conference, Journals, Computer Science, Keywords | 17 |
| Wiley | 2018–2021, English, Conference, Journals, Computer Science, Keywords | 30 |
| Google Scholar | 2018–2021, English, Conference, Journals, Keywords | 196 |
| Total | 409 | |
Fig. 3(%) Distribution of the papers by research library
Fig. 4Number of researched papers over the years
Exclusion and inclusion criteria
| Exclusion criteria | Inclusion criteria |
|---|---|
| The article is written in a language other than English | Papers that are written in the English language |
| The complete article or journal is unavailable | Paper is completely available |
| The paper has little to do with Deep Learning and detecting, classifying, or Predicting misinformation | The paper is related to machine learning and detecting, classifying, or Predicting misinformation |
| Later than 2018 | The most detailed and frequent repeated articles of the same sample were included |
Fig. 5(%) Distribution of Covid-19 misinformation approaches
Fig. 6(%) Distribution of reviewed papers covered based on techniques
Fig. 7Techniques used for misinformation classification over the last 4 years
Performance evaluation of neural network method
| Study | Problem tackled | Method | Performance | Description |
|---|---|---|---|---|
| [ | Classification | DNN + NLP | Accuracy: 81% | Combination model of DNN + NLP to identify fake or genuine information |
| [ | Detection | RvNN | Accuracy: 0.737 | Improve fake news detection based on bottom-up and top-down structured neural network |
| [ | Detection | GNN + NLP | ROC: 0.95% | Improves model with implementation of GNN |
| [ | Detection | Deep CNN (FNDNet) | Accuracy: 98.36 | The CNN-based model improves existing fake news detection |
| [ | Classification | DNN | Accuracy: 84.37 | Compares sentimental text classification of DNN and CNN models |
| [ | Detection | DT, kNN, LR, LSVM, MNB, BNB, NN, ERF, and XGBoost | Accuracy: 99.63 | Compares misleading information on Covid-19 using stated methods and NN provides the best accuracy |
| [ | Detection | CNN + BERT | F-Score: 68.24 | Covid-19 detection system using CNN and BERT |
| [ | Classification | DNN, ANN | Accuracy: 95.84 | DNN classification model outperformance ANN model and achieves better accuracy |
| [ | Classification | DNN, LSTM, BI-LSTM, GRU, BI-GRU, ID-CNN, SVM, Naïve Bayes | Accuracy: 97.900 | Predicts the validity of news and 1D-CNN outperforms |
| [ | Detection | Cross-SEAN | Accuracy: 94 | Covid-19 datasets with labelled true or false tweets |
Performance evaluation of NLP and ML methods
| Study | Problem tackled | Method | Performance | Description |
|---|---|---|---|---|
| [ | Detection | Evidence-based | Accuracy: 50.91% | Introduction of a publicly available dataset for verification |
| [ | Detection | Stochastic Gradient Descent | Accuracy: 87% | Fake news detection system based on news headlines |
| [ | Detection | Supervised ML Techniques | DT: Acc:0.968 Precision: 0.963 Recall: 0.973 F-M: 0.968 | 23 ML algorithms implemented to detection fake news and Decision Tree provides the best performance |
| [ | Detection | Logistic Regression, Support Vector Classification, and Naïve Bayes | LR: Accuracy: 84% | Covid-19 rumours detection and logistic regression provides the best accuracy |
| [ | Detection | DistilBERT | Accuracy: 97.2 | Covid-19 misinformation detection using NLP and explains why the news is false |
| [ | Classification | SVM, KNN, Naïve Bayes, AdaBoost, Bagging | Accuracy: 89.01 | Classifies Tweets into three categories based on sentiments |
| [ | Classification | Random Forest | F1 scores between 0.347 to 0.857 | Classifies four Covid-19 conspiracy theories |
| [ | Detection | LSTM, GRU, Decision Tree (DT), Logistic Regression (LR), K Nearest Neighbour (KNN), Random Forest (RF), Support Vector Machine (SVM), and Naive Bayes (NB) | GRU: Accuracy: 98.29 | provides a framework to detect Covid-19 misinformation using modified LSTM + GRU |
Performance evaluation of neuro-fuzzy method
| Study | Problem tackled | Method | Performance | Description |
|---|---|---|---|---|
| [ | Classification | ANFIS | Accuracy: 97.9, MSE: 3.188 | ANFIS is used to decrease error rate, give high precision and simplicity |
| [ | Classification | ANFIS | Accuracy: 92.16 | Implements ANFIS method for text classification |
| [ | Classification | ANFIS + Nonlinear SVM | Accuracy: 90 | Proposed a method of Classification of the political tweets using ANFIS and nonlinear SVM |
| [ | Classification | ANFIS | Accuracy: 99.30 Specificity: 99.71 Sensitivity: 70.25 Precision: 82.09 | ANFIS model outperforms CNN, Deep CNN, DGAM classification |
| [ | Classification | ANFIS | Accuracy: 99.62 | ANFIS improves robustness and increases classification accuracy |
| [ | Classification | Neuro-Fuzzy | Accuracy: 95.59 Precision: 0.9629 Recall: 0.9544 F-M: 0.9569 | NF-FR handles imprecise and problems with uncertainty |
| [ | Classification | ANFIS + DNN | Accuracy: 97.99 MSE: 0.0401 | ANFIS solves DNN problem of transparency |
| [ | Prediction | ANFIS + PSOGWO | Accuracy: 87.5 | Hybrid model of ANFIS and PSOGWO produces better outcomes |
| [ | Classification | ANFIS | Accuracy: 99.6 Specificity: 99.7 Sensitivity: 98.1 Precision: 98.5 F-M: 97.9 | Applied ANFIS in tumour classification |
| [ | Classification | ANFIS | Accuracy: 96 Specificity: 94 Sensitivity: 99 | ANFIS outperforms ANN, FIS in classifying breast ultrasound images |
| [ | Classification | ANFIS | Accuracy: 99.4 Specificity: 99.7 Sensitivity: 99.7 | Implementation of ANFIS model can improve the classification |
Fig. 8Accuracy as performance measure of different techniques
Strength and limitation of ANFIS model
| Strength | Limitation |
|---|---|
| Results precise output | High computational cost |
| High accuracy than other Neuro-Fuzzy models | A significant bottleneck to applications with large inputs |
| Robustness of results | The location of a membership function |
| Highly generalization capability | The curse of dimensionality |
Strength and limitation of DNN model
| Strength | Limitation |
|---|---|
| Works better with big data | Requires more data to work with than regular ML |
| Efficient computational power | Expensive computational as it has high complexity |
| The algorithm implemented runs faster | Black box nature |
Comparison of ANFIS, DNN, and hybrid model of ANFIS with DNN
| Techniques | ANFIS | DNN | ANFIS + DNN |
|---|---|---|---|
| Dimensionality | High | High | High |
| Robustness | High | Low | High |
| Generalization capability | High | High | High |
| Big data | Low | High | High |
| Accuracy | High | High | High |
| Computational cost | High | High | High |
| Complexity of black box nature | Low | High | Low |