| Literature DB >> 33564213 |
Iman Rahimi1, Fang Chen2, Amir H Gandomi2.
Abstract
The novel coronavirus (COVID-19) has spread to more than 200 countries worldwide, leading to more than 36 million confirmed cases as of October 10, 2020. As such, several machine learning models that can forecast the outbreak globally have been released. This work presents a review and brief analysis of the most important machine learning forecasting models against COVID-19. The work presented in this study possesses two parts. In the first section, a detailed scientometric analysis presents an influential tool for bibliometric analyses, which were performed on COVID-19 data from the Scopus and Web of Science databases. For the above-mentioned analysis, keywords and subject areas are addressed, while the classification of machine learning forecasting models, criteria evaluation, and comparison of solution approaches are discussed in the second section of the work. The conclusion and discussion are provided as the final sections of this study.Entities:
Keywords: Analysis; COVID-19; Forecasting; SEIR; SIR; Time series
Year: 2021 PMID: 33564213 PMCID: PMC7861008 DOI: 10.1007/s00521-020-05626-8
Source DB: PubMed Journal: Neural Comput Appl ISSN: 0941-0643 Impact factor: 5.102
Fig. 1Classification of scientific papers based on subject area
Fig. 2Research methodology used in this paper
Parameter settings
| Parameter | Value |
|---|---|
| Minimum number of occurrences | 1 |
| Criterion met | 1931 keywords |
Fig. 3Networks across the links (keywords analysis)
Fig. 4A detailed analysis (sum of works cited and number of records vs. Affiliations)
Fig. 5Susceptible, infected, and recovered (SIR) model
Fig. 6The susceptible, exposed, infected, and recovered (SEIR) diagram [18]
Proposed solution approaches for forecasting coronavirus 2019 (COVID-19)
| Algorithm | ||||||||
|---|---|---|---|---|---|---|---|---|
| Epidemic models | Time-series models | Nature-inspired algorithms | ||||||
| SIR | [ | Autoregressive model | Moving average | Autoregressive integrated moving average [ | Genetic programming [ | |||
| Simple moving average [ | ||||||||
| Other models | [ | |||||||
| SEIR | [ | Exponential models | Logistic growth model [ | Flower pollination algorithm [ | ||||
| SIRD | [ | Deep learning | Long short-term memory (LSTM) networks [ | Polynomial Neural Network [ | ||||
| Neural network [ | Ecological Niche models [ | |||||||
| Regression methods | [ | |||||||
| Prophet algorithm | [ | |||||||
| Phenomenological model | [ | Other models | Adaptive neuro-fuzzy inference system [ | Regression tree algorithm [ | Support vector machine [ | Iteration method [ | Support vector Kuhn-tucker [ | |
Fig. 7% of contribution of different solution approaches applied in the forecasting of COVID-19 confirmed cases
Strengths and weaknesses of proposed machine learning algorithms
| Algorithm | Strength | Weakness |
|---|---|---|
| Artificial neural network | Could access several training algorithms [ | The nature of being a black box [ |
| Support vector machine | Can avoid overfitting and defining a convex optimization problem [ | Choice of the kernel as well as speed and size of training and testing sets [ |
| Compartmental models (SIR, SEIR, SIRD, etc.) | Predict how the disease spreads Present the effects of public health interventions on the outcome of the pandemic [ | The proposed models are mostly deterministic and work with large populations [ |
| Nature-inspired algorithms (genetic programming) | Intelligent search [ Can integrate with certain decomposition algorithms [ | Several parameters should be set by the decision-makers The algorithms are approximate and usually nondeterministic [ |
| Prophet algorithm | Are robust in dealing with missing data [ | It is hard to use the algorithm for Multiplicative models Predefined format is needed for data before using the algorithm |
| ARIMA | Works for seasonal and nonseasonal models Outliers can be handled well | Changes in observations and changes in model specification make the model unstable [ |
| Deep learning | Results comparable to human expert performance [ | Requires large amounts of data The training process is expensive |